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ABSTRACT 

One of a regular series on the status and progress of 
studies into the nature of speech, instrumentation for its 
investigation, and practical applications of research, this report 
covers th* period of January 1 to March 31, 1982. The 13 studies deal 
with the following topics: (1) speech perception and memory coding in 
relation to reading ability, (2) the use of orthographic structure by 
deaf adults, (3) information support for speech, (4) the stream of 
speech, (5) using the acoustic signal to make inferences about place 
and duration of tongue-palate contact, (6) the patterns of human 
interlimb coordination that have emerged from the properties of 
nonlinear limit cycle oscillatory processes, (7) motor control, (8) 
the nature of motor control in Down's syndrome, (9) periodicity and 
auditory memory, (10) reading skill and language skill, (11) the role 
of sign order and morphological structure in memory for American Sign 
Language sentences, (12) perception of nasal consonants with special 
reference to Catalan, and (13) speech production characteristics of 
the hearing impaired. (FL) 
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SPEECH PERCEFriON AND MMOR1 CODING IN DELATION TO READING ABILITY 
Susan Brady, Donald Shankweiler,+ and Virginia Mann*+ 



Abstract . Previous work has demonstrated that children wh9 are poor 
^eaders have short-term memory deficits in tasks in which the 
stimuli lend themselves to phonetic coding. The aim of* the present 
study was to' explore whether the poor readers' memory deficit may 
have its origin in perception with the encoding of the stimuli. 
Three experiments were conducted with third-grade good and 4 poor 
readers. As in earlier experiments, the poor readers were found to 
perform less well on recall of random' word strings, and to be less 
affected by the phonetic characteristics (rhyming or not rhyming) of 
the items (Experiment 1). In addition, the poor readers produced 
more* errors of transposition (in the nonrhyming strings) than did 
the good readers, a further indication of the poor readers' problepu 
with memory for order. The subjects were tested on two auditory 
perception tasks, one employing words (Experiment 2) and the other 
nonspeech environmental sounds (Experiment 3)* Each was presented 
under two ;jconditions: with a favorable signal- to- noise ratio and 
with masking. The pocr readers made significantly more -errors than 
the good , readers when listening to speech in noi-se, but did not 
differ 'in perception of speech without noise or in perception of 
nonspeech environmental sounds, whether noise-masked or not. 
Together, the results of the perception studies suggest that poor 
readers have a perceptual difficulty that is specific to speech. It 
is suggested that the short- ter*a memory deficits characteristic of 
poor readers may stem from material-specific problems of perceptual 
processing. * >v 
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Many studies hav£ shown that children who are poor readers tend to 
perform deficiently on short-term memory tasks* There is considerable evi- 
dence, however; ihat the memory jaflp blem is specific to linguistic material and 
to <?ther material that lends ^pR*lf to linguistic representation. A hypo- 
thesis has been proposed that failure to make effective use of phonetic coding 
in short-term memory may account for some of the deficiencies poor readers 
typically show in language processing (Liberman, Shankweiler, Liberman, 
Fowler; & Fischer, 1977). Tests of this hypothesis have utilized the well- 
known phenomenon that when normal .adult subjects are required to recall 
strings of rhyming and nonrhyming letters or words, man^ more errors typically 
occur on the rhyming strings (Baddeley, 1 966; * Conrad , M 964, 1972). Children 
who are good readers, like normal adults* tend to be strongly affected by 
rhyme; poor readers, on the other hand, are significantly less, affected . For 
them, phonetic similarity has relatively little effect on recall (Liberman et~ 
al. f 1977). 

Subsequent experiments have confirmed and extended this result under a 
variety of conditions: when memory is tested by recognition as well as when 
it is tested by recall (Byrne & Shea, 1979; Mark, Shankweiler, Liberman, & 
Fowler, 1977); when sentences or word strings are the stim,uli as well^fes when 
letter 7 'strings are presented (Mann, Liberman, <& Shankweiler, I960); when the 
items are presented auditorily instead of visually (Shankweiler, Liberman, 
Mark, Fowler, & Fischer, 1979). In each of these conditions it was found that 
poor readers are relatively insensitive to the phonetic characteristics of the 
items. Accordingly, it has been supposed that poor readers have a general 
problem with the use of a phonetic code, however 'the material is presented, 
and not a specific difficulty in deriving a phonetic representation from print 
(Shankwe i"! pr & Ljberman, 1976). It would seem, therefore, that one reason for 
poor readers 1 deficient performance in sbort-term memory tasks is their 
failure to fully exploit phonetic coding. 

It remains to be determined what limits full utilization of phonetic 
codes by poor readers. To what extent' does* the problem arise in perception 
with the encoding of stimuli, and to what extent does the problem involve the 
use of information already represented in phonetic form? Our intent in this 
study was to investigate whether the poor readers' phonetic- coding deficiency 
in short-term memory is related to the perceptual process as such. 

A study by Rabbitt (-1968) gives a way to understand how such a 
relationship might come about. This sstudy points to a direct connection 
between -stimulus variables that affect perception and those that affect 
recall. In Rabbitt 1 s experiment, the subjects were required to listen, to, 
spoken d igi ts presented with a white noise mask. In one condition the 
subjects' task was to repeat individual items, in another condition they were 
tested for recall of strings of items. It was found that noise levels that 
produced no manifest effect on perception and recall ojf the individual items 
significantly impaired recall of the strings. Thus adding noise, and increas- 
ing the perceptual difficulty, adversely affected memory even when the 
individual items could still be identified correctly. The insight we gain 
from Rabbitt 1 s findings may give i^s a purchase on the problem of why poor 
readers typically reveal deficits in verbal short-term memory. Their failure 
to make full use of phonetic coding in short-term memory may be traceable, as 
Perfetti and Lesgold have supposed (1979), to a disorder at the level of 
perceptual processing. 



It is well known that severe reading problems often occur in children who 
show no obvious abnormalities in language development. These poor readers 
typically do not manifest clinically* apparent difficulties in perception of 
speech. It is conceivable, howeVer, that such children may have subtle 
deficiencies in speech perception that special testing procedures may bring to 
light. . . 

One study (Goetzinger, Dirks, & Baer, 1960) hints that in order to 
discern differences in perceptual skills among good and poor readers it may be 
necessary to use a quite demanding tasK Goetzinger et al. reported no 
difference-between reading groups for a list of well- articulated words but a 
significant'fcifference in favor of the good readers on a list of rapidly, and 
somewhat indistinctly; articulated items. Although the study does not permit 
a. direct comparison to be made ( different ' words occurred in^ the two test 
lists),, the results suggest that discrepancies in' speech .perception abiliti.es 
may have been present for good and poor readers that would be detected on a 
sufficiently difficult task. 1 _ 

-\ 

AJLtboiigh relevant data, are scarce^, there is reason to suggest that the. 
characteristic differences so often observe^ between good and poor readers on 
memory task's might be associated with differences in speech perception. Our 
purpose in the research we ' present here was to examine this po-ssibil ity. 
Accordingly, good "and poor readers were tested on a memory task in which the 
effects of phonetic* coding are known, to be discernible. Ueing the procedure 
of Liberman et al. (1977), we compared performance on recall of phonetically 
similar* (rhymjLng) and phonetically dissimilar (nonrhyming) sequences of mono- 
syllabic words in good and poor readers. It was expected that, as .in previous 
experiments, good readers, in contrast to poor, , would find recall of the 
rhyming sequences more difficult than the nonrhyming sequences, reflecting 
more efficient use of a phonetic • code. We then addressed the question of 
Whether the reading group differences on memory tasks are related to speech 
perception abilities. The subjects were tested on a speech perception task 
requiring repetition of monosyllabic words. The items selected included high 
and low frequency words phonetically balanced to permit phonetic analysis of 
errors and examination of .error location within the syllable. The stimuli 
were presented under two conditions, with and without masking noise, in order 
"to vary the difficulty of the task. In addition, a test of perception of 
environmental nonspeech sounds was conducted, again with and without noise 
masks, to enable us to investigate any differences in perceptual performance 
•that exist beyOrid* the speech domain. 

# \ 

n 

METHOD . " , 

Subjects * 

The subjects were third-grade children from a suburban public school in 
southern Rhode Island. A school reading specialist was asked to, select the 
poorest readers and the good readers from the third-grade classes. The 
children were given the Word Attack and Word Recognition subtests of the 
Woodcock Reading Mastery Tests, Form A (Woodcock, 1973), and a test of 
receptive vocabulary, the Peabody Picture Vocabulary Test (PPVTj.Dunn, 1965). 
On- the basis of scores obtained on the Woodcock test, 'two groups were formed 
that were non- overlapping in reading level. 



Eight children were eliminated because their inconsistent scores on the 
two Woodcock sub- tests made them difficult to classify^ as good or poor 
readers. Three additional selectio^ criteria were employed to determine 
eligibility for participation in the experiments. First, in order to restrict 
the range of vocabulary skills, only those children were select.ed whos3 PPVT 
IQ score fell between 90 and 120. An additional five children failed to meet 
this requirement. Second, in view of the evidence that the speech perception 
skills of children continue to develop during elementary school years (Finken- 
binder, 1973; Gbldman, Fristoe, & Woodcock, 1970; Schwartz & Goldman, 1974; 
Thompson, 1 963-) , subjects were selected whose ages fell within a limited range 
(96 to i 08 months) . The age requirement excluded five more potential 
subjects. And third, the remaining ^IrMren were screened for hearing loss. 
The right and left ears were presented with tones at 500 Hz (25 db) , 1000 Hz 
(20 dB) , 2000 Hz '(20 dB) , 4000 Hz (20dB)~, and " 8000 Hz (20 dB) , using a 
standard Audiometer. Seven children failed the hearing screening. 

Thirty 6hildren met all the requirements for participation in the study. 
Table 1 summarizes the characterises of the good and poor reader groups. 
The 15 children who qualified as good readers were well ahead^ of third grade 
reading skills with a mean reading grad^pP6vel of 5.88* The 15 children 
labelled ipoor readers averaged sligjiily more than one-half year below their 
expected level (with a mean reading grade level of 2.76). 



^ r lr~ 7 * 

• v , Table 1, - • - 

Means for Tfiird-Grade^Cbildren Grouped According tQ Reading Achievement 

Group Age ' IQa , * Reading G'rade b . % 

Good 15' 8 yr. 5 mo. , 106.8 5.88 . 

Poor 15^ b yr. 6 mo. 102.5 2.76 



a Peabody Picture Vocabulary' Test 

bFrom the' average of the reading grade scores obtained on the Word Attack and 
Word Recognition subtests"*^ the Woodcock Reading Mastery Tests, Form A. 



The ages of the £ood (mean - 8 yr. 5 mo.) ancf >t»or readers (mean = 8 
yr-." 6 mo.)' did not differ significantly* Nor were M;he IQ scores as assessed 
by the PPVT significantly different. 7 The mean IQ score for the good readers 
was 106. 8,' for t^e poor readers 1'02.5* ^ 

Procedure ; - • \ • 

'^Each child was tested ^ ind ividually for" three 'sessions. The first 
session included- the screening procedure, the speech perception noise-masked 
condition and one half ( se.t A as explained below) of the memo'ry .experiment. 
The second 3ession, occurring at least a week later, consisted -of the speech 
perception unmasked condition and the other half (set B\ .of ' the m memory 
experiment. - The third session, approximately' tw<? months after the -first, was 
devoted to the environmental-bounds experiment. 

* The ..experiments were conducted in a quiet room. ' The v tape-recorded 
material Tor the memory, speech perception, and environmental sounds tasks 
was played to subjects over earphones. VThe subjects' ' responses, were^ recorded 
on audiotape. Transcriptions of the subjects' responses were also made 
during the testing sesbion.' Th e tapes were played back within an hour of the 
experimental session in order to corroborate the transcription and to allow 
any necessary corrections. 

EXPERIMENT 1: Susceptibility of Go'od and Poor Readers to Phonetic 
' Confusions in Short-term Memory 

The firsj experiment employed § short-term memory task with rhyming and 
nonrhyming word strings. Our aim was to confirm previous evidence that j>oor 
readers'make less effective use of phonetic coding in short- t,erm memory than 
do good readers. 

Stimuli 

'Twenty strings of five monosyllabic < words were created, ten rhyming and- 
ten nonr-hyming? A single list of 50 common nouns was used ati the word source 
for the rhyming and nonrhyming tests. Thus word frequency, phonetic struc- 
ture, and wbrd length were strictly controlled for the two conditions. The 
five words' 'in each rhyming string ° had ' the <same vowel and the same final 
consortant if any. The five words in each nonrhyming string all, had different 
vowels and final consonant. 

The twenty strings were recorded on magnetic tape in two sets tA and B) 
of ten lists read by a phonetically-trained male speaker. * Each set comprised 
an alternating presentation of rhyming and nonrhyming strings. Within each 
string 4jhe 'items were spoken with a neutral prosody at ,the rate of one per 
second. The two sets are presented in Table 2. • 

Procedure 



Each subject heard, set A during the first session and set B during the 
second. On both occasions the same procedure was followed. 
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neat 


wheat 


sheet 


feet 


treat 



12 



The child was told that a list of words would be played and that the 

ta3k was to repeat the list in the order given v After practicing with two 

lists read by the experimenter, the subject then heard the pre-recorded set 
of ten five- item vord strings • 

Results and Discussion 

First, an analysis was made of the correct responses in terms of item 
recall and serial order. Secondly, the errors were analyzed qualitatively in 
relation to phonetic structure of the stimulus words. 

Analysis of Correct Responses 

The subjects* responses were scored in two ways. In the first proce- 
dure, a response was considered correct only if the item was accurately 
reported and if it was assigned to -he appropriate serial position. The 
second procedure ignored serial position and counted as correct all responses 
of words that had occurred in the given string, regardless of order of 
report. 

The error data for each scoring procedure (summarized in Table }) were 
•subjected to analysis of variance. We examine first the results from the 
more strict scoring procedure.' In agreement with earlier studies (Naidoo, 
M970; Miles & MilesT, 1977; Shankweiler et al., 1979; Mann et al., 1980) the 
overall, accuracy of recall was greater for good readers, F(1,28) = 5*6, 
p = .025.. {There was as expected, a significant effect oT list type, 
"F(1,28) = 44*.2, p < .001. And, as predicted, the good readers made fewer 
errors on the nanrhyming word sequences than on the rhyming. The poor 
readers also showed an effect, though a smaller one; of phonetic similarity. 
Thus, while we obtained significant effects' of reader group and of list type 
that conformed to the pattern of earlier studies ( Shankweiler. et al., 1979; 
Mann' et ai . , : 1980), the interaction between reading group and list type did 
not reach significance, JF( 1 , 28) = 2.9, JP = *098. 



Table 3 

Experiment 1: Mean Number Correct Summed Over Serial Positions2 
for Strict Order Scoring and for Order Free Scoring 



Order Correct Scoring Order Free Scoring 

Rhyme Non-Rhyme Rhyme Non-Rhyme 

Good 15-8 28.0 ■ 32-7 , 35-5 

Poor 12.2 19.4 f 31-7 29-5 

difference 3*6 8.6 1*0 6.0 
2Maximum = 50 . 



Table 4 



Experiment 1: Analysis, of Incorrect Responses 



incorrect responses 
total responses 



Reading 

Group Condition 



Good-' Rhyme 



Nonrhyme 



Poor Rhyme 



Nonrhym* 



15 



17 



16 



26 



Composition of incorrect responses: 
The percentage with:a 

vowel and 
initial consonant final consonant 
initial final and vowel from from same 

consonant vowel consonant same word word 



66 



81 



59 



79 



99 



97 



99 



98 



98 



97 



97 



92 



19 



16 



79 
68 

75 
73 



ai.e., the percentage with phonetic information that was available in 
the two strings. 



14 



15 



Evidence that the two reading groups differed in the recall strategies 
they employed emerges when the data were re-examined after applying the more 
lenient scoring procedure. As in other studies utilizing lists of high intra- 
list similarity, item information suffers less than order information. So for 
both groups the order-free recall scores are markedly higher, particularly for 
the rhyming strings. Overall, the performance level of the two reading groups 
was not significantly different, F(l,28) = 3.6, j> = *071 , nor was^ there a main 
effect of rhyme, j(l,28) - .1, j> > '500. In Table 3 we can see, however, that 
while the scores for the two groups were very close in the rhyming condition, 
they were dissociated on the nonrhyming sequences. Thus, we find a signifi- 
cant interaction between reading group and list type, J?(1,28) = 6.7, 
b = .016). The good readers showed improved - performance in the nonrhyming 
condition, _F(1 ,28) = 4.2, p = .05, where an efficient phonetic strategy can 
operate to advantage. The"~poor read res, in contrast, did not improve on the 
nonrhyming sequences, _F(1,28) = 2.6, p < .20; indeed they tended to do worse. 

The memory experiment undertaken here was intended mainly as a replica- 
tion. In previous research, good readers evidenced generally- superior recall 
but were relatively more penalized by phonetic similarity within a list than 
were poor readers. The present study does generally conform to this picture, 
though here the differences between the groups were somewhat less marked, 
perhaps because the subjects were a year older than those in the earlier 
research. At present, the appropriate studies to examine developmental 
changes in use of a phonetic strategy have not been done. If poor readers are 
employing a nor* phonetic strategy, as has been suggested (see Byrne & Shea, 
1979), we might expect their use of this strategy to diminish with increasing 
age (Conrad, 1972). 

Qualitative Analysis of Errors 

The construction of the present experiment, using words as stimuli rather 
than letters, permits a closer inspection of the nature of the difficulty poor 
readers , have in preserving order information. In analyzing the response 
sequences, it became apparent that the recall problems of poor readers apply 
not only to the order of the stimuli in a string but- also to the retention of 
phonemic sequences within individual words. The subjects' . response sequences 
(for both good and poor readers) included items that had not occurred in the 
strings. These errors were often obvious recombinations of phonetic compo- 
nents that had been present in the presented sequence (e.g., for the target 
items train and plate several subjects reported tfait and plane ) . Such errors 
of transposition have previously been reported in memory experiments with 
adults (Drewnowski, 1980; Ellis, 1980). We undertook to analyze the phonetic 
errors in the present experiment to determine how often the incorrect 
responses could be accounted for as transposed phonetic segments from adjacent 
items. In this analysis, the given string and the previous sequence were 
considered as the available source of phonetic information. 

The data base for determining whether errors of transposition we>re 
present was the 451 phonetic errors obtained from all 30 subjects. Seven of 
these errors were whole words from previous lists and were disregarded. An 
additional seven were discounted because they were phonetically unrelated to 
any item in either word list. The phonetic composition of the remaining 437 
responses could, for the most part, be accounted for in terms of the phonetic 



units present in the particular string and the preceding string. In Table 4 
we present a breakdown of the transposition errors. Good and poor readers 1 
transposition errors were very similar in pattern* When a phonetic unit was 
-transposed, it was recombined in the same syllable position in which it had 
originally occurred. Most commonly, vowel and final consonant (or consonant 
cluster) were preserved as a unit with a substituted initial consonant (or 
consonant cluster) . (Table 5 lists a representative sample of the observed 
error responses.) This error pattern suggests that phonetic segments are not 
equally free to dissociate and recombine in memory. If they did operate as 
independent units on recombination, there would be no reason to expect greater 
cohesion between the vowel and the final consonant than between the initial 
consonant and the vowel. 



Table 5 

Experiment 1: Examples of Transposition Errors 
Presented I terns Responses 



roar + fat rat 

bear + shell bell 

score + cat scat 

knee + state neat 

chair + pain chain 

hair + spell hellv 

spell < fate spate 

pie + feat peat 

tea + brain tain 



To ascertain whether the incidence of transposition errors differentiates 
the reading groups, an analysis of variance was carried out on the proportion 
of transposition errors to correct responses for the rhyming and nonrhyming 
conditions. The overall proportion of transposition errors to correct res- 
ponses did not differ significantly for the two reading groups, ?{) ,28) = 1 .8, 
jj 58 .194. -However, while both groups produced a higher proportion of 
transposed responses in the nonrhyming condition,' the difference was moj^e 
pronounced for the poor readers. 'These effects are manifested by a sigBltfi^ 
cant effect of list type , _F(1 , 28) = 10.4, j> = .004, and by a signifxrotft 
interaction between list type and reading group, jKl ,28) = 4.9, j> ■ .036. 
Thus it seems that the greater difficulty poor readers have in retaining the 
order of words in the nonrhyming sequences may be compounded by a problem with 
the preservation of order information within a word. In the case of the 
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rhyming strings, of course, subjects may well produce transposed responses 
that would be undetectable. This may accbunt for the- better performance of 
the poor readers in the order- free scoring of rhyming words. 

The present study confirms earlier reports that poor readers, recall fewer 
items than good readers and that they are less affected by phonetic similarity 
within a list than are good readers (Liberman et al., 1977; Mann *t al., 1980; 
Mark et al., 1977; Shankweiler et al., 1979)- In this study the result of the 
phonetic error analysis allows us to extend our understanding of poor readers 
performance on memory tasks. It indicates first.of all that the poor readers 
definitely obtained the phonetic information in the stimuli. However, the 
greater incidence of transposition errors by poor readers (in the nonrhyming 
condition) also points, to inferior retention of the correct combinations of 
phonetic sequences specifying the inc^vidual items. This finding is consis- 
tent with other indications (Katz, Sh&nkweiler, & Liberman, in press) that 
poor readers encounter difficulty in preserving serial order information in 
linguistic tasks. It further suggests that the problem extends to the 
ordering of segments within the syllable. 

EXPERIMENT 2: Speech Perception in Good and Poor Readers 

We now turn to the second question: the speech perception abilities of 
the good and poor readers. The aim of Experiment 2 was to investigate whether 
the language deficits of the poor reader are evident in phonetic perception as 
wel c l as in short-term memory. 

Stimuli 

The perception vtest consisted of 48 words especially chosen to control 
for syllable pattern, phonetic composition, and word frequency. There were 12 
words for each of the following syllabic patterns: CVC ( consonant- vowel- 
consonant) , CCVC, CCVCC,' and CVCC. Within each syllable pattern, half of the 
words selected were judged to have high frequency of occurrence in children's 
-literature and half had low frequency (Carroll, Davies, & Richman, 1971). The 
frequency values were validated with a second word frequency source (Thorndike 
& Lo.vge, 1944). 

; n "order to permit a clearcut analysis of phonetic errors and of errors 
of position (i.e., initial, medial and final word position) words were chosen 
to provide a systematic phonetic set. Twenty words began with stop consonants 
(/b/, /d/, /g/, /p/, /t/, /k/) and twenty words began with fricatives or 
affricates (/tj/, /s/, /f/<, /J/, /d 3 / , /v/) . For each of the above phonetic 
categories half of the occurrences were in high frequency words and half were 
in low frequency words. Of the remaining eight items, four began with nasal 
consonants (/m/', /n/) and four with liquids (/r/, /l/). The same distribution 
of phonetic elements occurred in word final position. 1 

The occurrences of segments in medial position were not controlled except 
in one respect: every syllabic pattern that occurred in a high frequency word 
was matched in a low frequency word (e.g., front [high frequency] and flint 
[low frequency] were matched in syllabic pattern: each consisted of the 
sequence: fricative, liquid, vowel, nasal consonant, stop consonant). The 
word list is presented in Table 6. 
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Table 6 



Experiment 2: Speech Stimuli 



High Frequency Words 



Lpv F requency Words 



door 


bal e 


team 


din 


road 


lobe 


knife 


mash 


chief 


chef 


job 


fig 


grain 


tram 


breath 


grouse 


crowd 


crag 


sleep 


slag 


sc ale 


spire 


speech 


skiff 


front 


flint 


plant 


clamp 


friend 


frond 


clouds 


glades 


blocks 


drapes 


planes 


prunes 


bank 


kink 


chance 


finch 


list * , 


rasp 


month 


nymph 


child 


' vault 


ships 


shacks 



is 
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The words were recorded by a phonetically-trained male speaker, .each 
toeing produced as the final word of a meaningful sentence* The sentences were 
subsequently digitized at 10,000 samples/sec and each stimulus word was 
excised from . the rest of the sentence, using the Haskins WENDY waveform 
editing system (Szubowicz, Note 1). The words were then arranged into a fixed 
random sequence and recorded cr*to magnetic tapfe. When the stimuli were 
replayed,, a comfortable listening level was selected, approximately 78 dB SPL. 

The noise-masked condition" was then constructed by following the method 
.described by Schroeder ( 1 968) • The technique involves computing the masking 
noise signal directly from the digitized speech sample to be' 1 masked. Each 
speech sample of the digitized waveform of. a stimulus word is multipled by 
another, randomly chosen with equal probability. The waveform that results 
from this manipulation preserves the time-varying amplitude characteristics of 
the speech signal while having a flat long-term frequency spectrum. Thus it 
is referred to as an amplitude-match noise signal. Each digitized word and 
its amplitude matched' noise signal were added linearly to yield a 0 dB S/N 
ratio. The words in noise were subsequently arranged into a fixed random 
order and recorded on magnetic tape. 

Procedure 

— 0 

♦ 

Each subject listened to the noise-masked words curing session 1 , and the 
unmasked words during session 2. The child was told that a list of words 
would be played (and, in the noise-masked condition, that the words were 
recorded in some noise). The subjects were instructed to repeat each item 
clearly immediately after hearing it. The test sequence was preceded by four 
practice trials. 

Results and Discussion 

Few words were missed by either the good readers (mean errors = J. 3) or 
the poor (mean errors - 2.0) in the unmasked condition. As we can see in the 
left-hand portion of Figure 1, whereas both groups made considerably more 
errors in the noise-masked condition, the poor readers (mean errors = 20.7) 
did markedly worse than the good readers (mean errors = 15.0* 

These effects were analyzed oy a two-way factorial analysis of variance. 
The between- groups factor, reading achievement, was significant, JF( 1 , 28) ■ 
17.6, 2 < * 001 » with g00d readers misreporting fewer words than poor readers. 
In addition, there was a significant main effect of noise, _F(1 ,28) = 687.4, J> 
< .001. From previous perception research with adults (e.g., Licklider & 
-toller, 1951), the detrimental effect of masking noise on intelligibility is 
well known. What is new, from our point of view, was the, finding that there 
were notable differences in the magnitude of the effect of noise on perception 
for the two reading groups. A significant interaction between the effect of 
masking^ and reading group was obtained, J?(1 ,28) = 15.8, 2 < • Vhen the 

stimuli were presented clearly in the unmasked condition, all the subjects 
reported the stimulus items accurately. The addition of^noise, however, made„ 
it significantly more difficult for the poor readers to perceive the stimuli 
than for the good readers to do so. Thus it seems that the speech perception 
skills of poor .readers are less effective than those of good readers but that 
this difference is observable only when they are required to respond to 
degraded stimuli. 
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figure 1. Performance of goo(J and poor readers on the speech perception task 
(Experiment 2) and the environmental sounds task (Experiment 3), 
plotted in mean percent correct* 
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Words of high and low frequency of occurrence were employed in the 
experiment as a means of examining whether differences between the groups in 
perceptibility of the items were attributable to differences in vocabulary 
skills. In Figure 2 we can see the performance of the two reading groups on 
the high and low f^&qucncy items. While the variable of word frequency had a 
large effect jYt^ne perceptibility of a word, _F(1 , 28) = 1?5.0, j) < > there 
wasr no r interaction between the word frequency variable and reading group, 
F(1,28) = .01?, p > .500. The poorer performance of the poor readers cannot, 
Therefore, be attributed to possible differences in word knowledge. Instead, 
it points to a problem in perception of speech. 

Thus far we have examined the results by viewing each response either as 
being totally correct or as an error. In order to determine where the 
perceptual mistakes were occurring, it is useful to examine the nature of the 
errors as was done by Shankweiler and Liberman (1972). Accordingly, each 
stimulus was broken into three segments: the initial cluster, the medial 
vowel and the final cluster. 'A. given error response couid deviate from the 
target stimulus at one, two, or all three word positions. The error data for 
this analysi - are summarized in Table 7. For both reading groups, the 
greatest number of errors occurred in .the initial portion of the word, the 
final position was second in error rate, and very few errors were made on the 
vowel in medial position This position effect was significant, _F(2,55) = 
1b?.2. j) < .001, with no difference in error pattern between the good and poor 
readers. The lack of an interaction between position effect and reading group 
suggests that tho basis for the error pattern was the same for both good and 
poor readers. We will briefly digress to consider what these factors might 
have been. 



Table 7 



Experiment 2: Speech- in-Noise : Error Location Within the Stimuli2 



Initial Medial Final 



Good 11.27 2.2 7.07 Mean number 

of errors on- 
Poor '14.67 3.7 8.93 48 trials 



2Error position not exclusive 
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Figure 2. Speech perception (Experiment 2): Mean percent correct on the 
noise-masked condition replotted as a function of word frequency. 
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The uneven distribution of errors across the three word positions . seems 
to correspond with the relative acoustic saliency of the segmen-ts. The vowel 
in acqpstic terms is more intense than consonants^ and is longer in duration. 
It is therefore not surprising to observe superior' identification of vowels on x 
' a listening task. Our . finding that the % initial consonant (or consonant 
cluster) is misheard more often „ than the final consonant (or consonant 

* - cluster) parallels research with CV and VC syllables (see Ohde & Sharf, t977, 

for a major paper ip this area; and Ohde & Sharf, 1981, and Pols & Schouten, 
1981, for recent discussions of those findings), and again seems to be related 
to the acoustic characteristics of £he segments. The results of research on 

• ' ' the speech cues suggest that the consonant in final position is more glearly 

represented in the acoustic signal than is the initial consonant. Syllable 
final formants have been observed to have transitions of greater duration - 
(except following the vowels /e/ and /if) (Lehiste & Peterson, 1961) and 
greater frequency change (Broad & Fertig, 1970) than have initial transitions. 

1 Further, the vowel nucleus of the syllable has been found to provide a variety 
of cues that may aid in identification of ' final segments, Peterson and 
Lehiste, 1960; observed vowel lengthening accompanying voiced final fricatives 
and voiced final consonants, and greater nasalization, of voVels preceding 
nasal consonants than for vowels following nasal consonants. - Thus final 
consonants may be easier to perceive because a greater amount of information 
specifies their identity. 

In view of the position effects obtained here, it seemed appropriate to 
examine the phonetic com-position of errors occurring in initial and final 
position. For both posritions, an adequate sampling was available to compare 
the relative frequencies of occurrence of errors on stop consonants and 
fricatives (see Table 8), but' not on n liquids or nasals. Accordingly, an 
analysis of variance was carried out o." the stop consonant errors and the .< 
fricative errors with error position, initial or final, specified. In this 
analysis our previous findings were again substantiated: good readers made 
' , fewer errors, F(1,28) = 10.0, j> s more errors occurred on initial 

position than on final, F(1,28) = 51.2, jd < - 001 » and there was no interaction 
between reading. groups and the position effect. A significant difference was 
obtained between the two phonetic categories examined. More stop consonants 
were missed than fricatives,.. F(1 ,28) = 51.1, j> < .001 and an interaction 
between reading group 'and phonetic category was obtained, _F(1 ,28) = 5.4, j> = 
.03. The poor readers missed the stop consonants significantly more often 
than, did the good readers. This could be taken as an. indication that poor 
readers have particular difficulty in processing stop consonant?. At the 
present, ,we are inclined to make the more conservative speculation that, with 
, the particular noise utilized, the stop information in the signals was 
relatively more obscured than was fricative information. Given that the 
* amplitude characteristics of the word were preserved, in the noise signal, an 
important cue for -fricative identity would also^ be preserved while place 
information for the stops yrould be less salienc. 
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Table 8, 



\ . Experiment 2\ ^ 

Speech- in-Noise: Analysis of Error Position and Phonetic Category 

» 

^Relative occurrence of errors of a phonetic - category: e.g. the stop conso- 
nants missed in initial position/ the stop consonants- that occurred in - the 
initial cluster. ... 

Initial Position Final Positfon 



* Stops Fricatives * -Stops Fricatives 

Good » 25.1 12.7 11.8 • 12.9 

Poor - 35.4 17.7 , 17.7* 13.4 



In sum,„w« found that on the unmasked condition the poor readers did as 
well as ^he good readers. When the perceptual system was stressed by the 
addition of noise, the poor readers made significantly more errors in 
perceiving the stimuli than did the good readers. With these results in hand, 
we may now consider the question whether the -difficulties the poor reader has 
yith reading may stem from a more general problem in auditory perception. If 
poor readers are generally inferior to good 'readers on another auditory 
perception task, where speech processing is not required,- a different in- 
terpretation of the nature of the poor readers' problem would be necessary 
than would be appropriate if the problem were specific to speech. 

EXPERIMENT 3: Perception of * Environmental Sounds Vith and Without 
^ Noise Masks ■ 

In the final experiment the subjects listened to a tape of environmental 
sounds: .first with the stimuli in noise, then in- quiet. 

Stimuli 

The stimuli for this experiment were selected and .edited frdm a^ magnetic 
tape recording of environmental sounds that had\*been obtained ftom the 
Neuropsychology Laboratory at the University of Victoria (Spree^n & 'Bonton, 
1969). The source tape had 26 sounds, two of which wez*e excluded for use here 
because they contained speech. 'The remaining 24 stimuli, listed in Table 9, 
included human nonspeech sounds (e.g., coughing)-, human activities (e.g., 
knocking on a door), mechanical sounds (e.g., machine-gun fire), animal noises 
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Experiment 3: Environmental Sounds Stimuli ' 



1 . Knocking on a door 

2. Water running from a faucet ~ i 
5 # * Organ - wedding march 

4* Phone ringing, 

5* Whistling % 

6. Airplane engine _ 

7. Door opening and closing^ 

8. Artillery - 

9. Car starting up and driving away. - 

-10. Dialing a phone ,^ 5 - < 

1 1 . Drum * . ^ 

12. Birds . 

1-3. Church bell- time 

14. Frogs and crickets ■ , m . , 

15» Piano 

16. v Dcg barking 

17* Trumpet fanfare « 

18. 'Train whistle ' % " 

1 9» Ca't meowing " " '< 

20. Clapping * - ^ 

21 . Coughing v 

22. Baby crying 

23. Thunder ; 

24- Typing v . # , 
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(e.g., frog croaks and cricket chirps), and sounds of nature (e.g., thunder). 
Each sound was digitized on the Haskins Laboratories DDP-224 PCM system and 
recorded on magnetic tape. One taped sequence, for the unmasked condition, 
contained the sounds presented in a fixed random order. In constructing the 
noise-masked sequence, it was not advantageous to use amplitude matched noise's 
as we had done in the case of the speech perception experiment, since the 
amplitude characteristics of the environmental sounds often provided strong 
cues to the identity of those sounds. We therefore chose instead to use a 
"broad band (0 to 10 kHz) white- noise signal as the masking stimulus. Pilot 
Vork suggested that a 0 dB -S/N ratio, as employed in the speech task, did not 
sufficiently mask the stimuli, but that a -2 dB S/N ratio would be appropri- 
ate. A second sequence for the noise-masked condition was recorded with each 
gound. masked by the white noise signal at the -2 dB S/N ratio. The stimuli 
for the two listening conditions were replayed at a comfortable listening 
level' of approximately 75 dE SPL. 

Procedure * 

Both the noise-masked and the unmasked stimuli were presented in a single 
session, with all subjects listening to the .noise-masked tape first. Prior to 
the testing the examiner explained that the child would hear two sets of 
sounds and that in the first set the items were recorded with noise. The 
child was asked 1to identify the source of each sound immediately after hearing 
it, providing as much detail as possible. Three practice trials were 
conduc ted without noise, to familiarize the subject with describing nonspeech 
sounds. 

Results and Discussion 

* ' v 
The, subjects' responses were compiled into a single list. Before 

scoring, all the responses ix> each sound were evaluated. A point system was 

devised ranging from 0 to 3» A score of zero was assigned if the response 

bore no relation to the stimulus; three was awarded if a fully specific 

identification had been provided. For the intermediate scores, a score of one 

was given if the response reflected the nature of the sound though wrong in 

detail (e.g., for coughing, if the S responded 'talking' or 'laughing' that 

person had correctly determined that a human vocal tfact was the source); two 

was assigned if the response was not inaccurate but somewhat unspecific (e.g., 

for an organ playing the, wedding march, the response 'music'). Response's 

distributed themselves somewhat unevenly: for some of the stimuli not 

four of the scoring categories were assigned. The scoring was reviewed by a 

colleague who did not know which responses came from good readers and which 

from poor ones. Discrepancies in numerical assignment by the two scorers 

occurred' for two responses and these were resolved by joint discussion of the 

two cases. The subjects' answer sheets were then scored and tabulated. The 

mean error score in the unmasked condition . was 6.7 for the poor readers an£ 

7.6^for the good readers (maximum =-72). In the noise-masked condition the 

mean error scores were 31 »4 for the poor readers and 36.9 for the good 

readers. Thesp performance levels 'are displayed 1 in the right-hand portion of 

.Figure 1 . 

As .in the speech perception experiment,' fev errors were made by ei*ther 
\reading group in the unmasked condition. With the addition of masking noise, 



performance for both groups was markedly reduced. The analysis of variance 
revealed a main effect of noise, J[(1 ,28) = 510.9, P < -001, and a main effect 
of reading group, F(1,28) - 4.7, _p = .04. We note that the poor readers 
performed better than the good readers on the nonspeech task. However, if age 
and IQ are controlled, the difference did not reach significance, 
P(1,26) = 3.6, p = .071.2 Given the equality of the performance of the poor 
"readers with thai of the good readers on this nonspeech auditory task, we can 
rule out inattention as .the explanation for their inferior performance on the 
noise-masked speech perception task. The results of this control experiment 
further suggest that the difficulty the poor readers manifested in perceiving 
speech in noise is not the , consequence of generally deficient auditory 
perceptual ability, but rather is related specifically to the processing 
requirements for speech. 



Earlier work has demonstrated that children who are poor readers have 
short-term memory deficits in situations where the stimuli lend themselves -to 
phonetic coding. The present experiments were intended to investigate the 
basis of this deficit, by asking whether the language processing problems of 
poor readers may extend to the area of phonetic perception. Third-grade 
school children selected*: for reading ability were first tested' on serial 
recall of word strings, a task that previously had been found to differentiate 
good and poor readers (Mann et al., 1980). As before, the poor readers made 
more errors than the good readers. The results -are consistent with the 
'hypothesis (Liberman et'al., 1977; Shankweiler et al., 1979) that a failure to 
use phonetic coding efficiently l'eads to the poor reader's deficiency in 
short-term memory for labelable stimuli. 

In -order to investigate the origin of this memory coding problem, the 
subjects were further tested on two tasks. One of these employed spoken words 
and the other, nonspeech environmental sounds. Each task was presented under 
two conditions: one with a favorable signal- to- noise ratio and one with 
masking noise. The results indicated a deficit* for the poor reader group that 
was specific to speech stimuli and occurred only in the noise-masked condi- 
tion. Significantly more errors were made by the poor readers than the good 
readers when listening to speech in noise; the groups did not differ, however, 
in the perception of nonspeech environmental sounds, whether noise-masked or 
not. This pattern of results suggests that the poor readers could process the 
speech signal adequately, as expected, but they required a higher quality 
signal for error-free performance than the good readers. The absence of 
differences between the reading groups on the control experiment with environ- 
mental sounds suggests that the poor readers' problem is not- manifest on just 
any auditory task in which the stimuli are noisy, but is instead more 
selective. The joint outcome of these perception studies suggests that poor 
readers require more complete stimulus information than good readers in order 
to apprehend the phonetic shape of spoken words. 

The present experiment has demonstrated associated deficits on the same 
group of poor readers: inferior performance on serial recall and inferior 
performance on a stringent test of speech perception. We now turn to consider 
how these two deficits might be related. First, we have noted that poor' 
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readers show weak effects of phonetic similarity in recall tasks, a fact that 
has been taken as evidence that they make inefficient use of phonetic coding 
in short->term memory. In the memory experiment of the present study, the 
analysis of the error responses provides direct evidence that the poor readers 
were using a phonetic code to retain material in short-term memory, though, of 
course, less effectively than the good readers. The errors that occurred were 
rarely semantically related to the target items, which might have indicated 
use of an alternative coding strategy; instead, they consisted, of transposi- 
tions of phonetic segments from adjacent syllables. Such an error pattern as 
we obtained seems possible only if the subjects were indeed using a phonetic 
coding strategy. Thus, it is apparent that whereas both good and poor readers 
•were phonetically coding the stimuli,' the poor readers were more apt to 
exchange segments across word boundaries and they experienced greater diffi- 
culty in retaining the crder of words within each word string. 

Thus the suggestion that poor readers have" greater difficulty in correct- 
ly retaining phonetic representations is corroborated by the pattern of their 
errors on the serial recall task. In the word perception task, we obtained 
evidence that poor readers also experience greater difficulty perceiving the 
phonetic form. On the contrary, analysis of errors in word perception showed 
that good and poor readers did not differ in the effect of word frequency on 
item identifiability. Therefore, the greater susceptibility of the poor 
readers io errors of identification apparently does not arise from differences 
between good and poor readers in vocabulary level. The problem thus appears 
to be not in dealing with the linguistic content of the stimulus items, bu+ 
rather with the form. In perception as well as in recall of linguistic items, 
the poor readers' problems would seem to stem from failure to adequately 
internalize certain formal properties of language: in these instances, 
properties relating to the phonetic pattern. 

We may speculate therefore that the problems of 1 poor readers, evident on 
both the memory task and the perceptual task, arise, at least in part from a 
common cause. In this connection, it may be relevant to recall the finding by 
Rabbitt ( 1 968 ) , to which we have referred, in which there was shown to be a 
relationship between recall performance and the stimulus factors that affect 
perceptual clarity. When adult subjects were asked to recall strings of 
digits, recall of items presented without noise was impeded if subsequent 
items were presented in noise. Thus, making some items difficult to perceive 
seems to reduce ability to rehearse the non-noisy items of the -string also. 
We may speculate, by extension, that poor readers' recall suffers in part from 
the difficulties they incur in perceptual processing. 

Thu3 one may surmise from our results that the recall performance of poor 
readers for words presented auditorily suffers as a result of faulty phonetic 
coding of the stimuli. Moreover, we suppose that this difficulty may arise 
whenever a phonetic representation is formed irrespective of the sensory 
modality of the signal. We base this conjecture on the outcome of earlier 
findings (Liberman et al . , 1977; Shankweiler et al., 1979) which have shown 
that the failure of poor readers to make full use of phonetic coding in recall 
occurs both with auditory presentation and with visual presentation of the 
stimulus items. These parallel findings for presentation of stimuli by ear or 
by eye led us to suppose that poor readers' problems in memory coding are of a 
linguistic nature. 




It is noteworthy that other investigators who have employed similar 
criteria for subject selection, but who have used very different experimental 
approaches to that adopted in the present study, have reached a similar 
conclusion. Using the memory scan procedure of Sternberg (1966), Katz and 
Wicklund ( 1 97 1 ) have found slower .encoding times for poor readers than for 
good readers with visually- presented word-strings. If we are correct in 
supposing that the.memory deficit in poor readers at least in part has its 
origin in phonetic perception,, it should be possibLe to . demonstrate differ- 
ences in a variety of situations in the facility and accuracy with which good 
and poor readers process linguistically codable material that is presented 
either visually or auditorily. 

REFERENCE NOTES 

1. Szubowicz, L. tutorial guide to Wendy : The Haskins wave editing and 
display system . Haskins Laboratories, 1 977 ♦ 
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FOOTNOTES 

1 In word final position the fricative and affricate set was slightly 
different, consisting of /f/, /s/, /t}/, /J/, /©/ and /z/. 

2 In Experiments 1 and 2, the data were likewise reanalyzed controlling 
for age and IQ. In these experiments, the significance of the differences 
between reading groups was not reduced when age and IQ were controlled. 
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THE USE OF ORTHOGRAPHIC STRUCTURE BY DEAF ADULTS: RECOGNITION OF FINGER' 
SPELLED LETTERS 

Vicki L. Hanson 



Abstract , Deaf adults' knowledge of English word structure was 
tested in a^task requiring letter v report for f ingerspelled words, 
pseudowords, and nonwords. Deaf subjects, like hearing subjects, 
were sensitive to orthographic structure as indicated by accuracy of 
letter report: Letters of words were reported most accurately, 
while letters of pseudowords were reported more accurately than 
letters of nonwords. Analysis of the incorrect letter reports for 
correctly recognized words revealed that deaf subjects tended to 
produce orthographically regular responses. However, in contrast to 
the reports of hearing subjects, the responses of deaf subjects did 
not tend to be phonetically^, consistent with the presented word. 
These results provide clear evidence that deaf adults are able to 
abstract principles of English orthography, although the phonetical- 
ly inconsistent letter reports suggest that the spelling process for 
deaf persons may be fundamentally different from that for hearing 
persons. 

The present research examines the use of orthographic structure by 
prelingually and profoundly deaf adults. The orthography of English reflects 
the phonological structure of the spoken language. As a result, segments of 



*An earlier version of this paper was presented ^at the meeting of the American 
Psychological Association, Los Angeles, August, 1981. 
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the written language map onto segments of the spoken language. The question 
here is whether deaf adults, in the absence of normal speech input, are able 
to abstract the regularities of English orthographic structure. 

Ability to use the regularities of the orthography is an important 
component both in word recognition and in spelling. Research on word 
recognition with normally-hearing adults has found that there is an advantage 
in letter recognition for orthographically regular nonsense words (pseudo- 
words) over orthographically irregular nonwords (Aderman & Smith t 1971; Baron 
& Thurston, 1973; Carr, Davidspn, & Hawkins, 1978) and an advantage in letter 
recall for these regular over irregular nonsense words (Gibson, Pick, Osser, & 
Hammond, 1962). 

In -spelling, the ability to access and exploit the orthographic regulari- 
ties of English is a factor determining spelling success. While accurate 
spelling of words can result from rote memorization or from visual recognition 
of the correct spelling from a collection of possible spellings (Simon & 
,Simon, 1973; Tenney, 1980), these strategies* ignore the systematic aspects of 
English orthography (Chomsky, 1970; Klima, 1972; Venezky, 1970). Recent work 
by Fischer (1980) has shown that good spellers have greater ability to exploit 
these regularities of the orthography than do poor spellers. 

To date, little work has been concerned with the question of use of 
.orthographic structure by deaf individuals. One study that has been directed 
at this issue is that of Gibson, Shurcliff, and Yonas (1970). Testing for 
recall of tachistiscopically presented pseudowords and nonwords, they found 
.that deaf adults, like hearing adults, correctly recalled more of the 
orthographically regular than of the orthographically irregular letter 
strings. Similar findings were obtained by Doehring & Rosenstein (1960) in an 
experiment with deaf children (ages 9-16 years). They found better recall of 
CVC trigrams (pseudowords) [than of CCC trigrams (nonwords). These findings 
led Gibson et al. (1970) to conclude that "The redundancy contributed by 
invariant mapping of speech sounds may well make it easier for the hearing 
child to pick up the common spelling patterns an' regularities as he learns to 
read, but clearly it can be done without this" (p. 71). 

The present research examined their conclusion. The ability of deaf 
adults to use orthographic structure in word recognition and in reporting the 
letters of words was tested. Deaf subjects in this research were all 
congenitally and profoundly deaf adults. These persons are unable to acquire 
knowledge of speech by normal meansl -Since the orthography of English 
reflects the structure of the spoken language, these deaf adults may be 
ex pec tea to be less able than hearing adults to acquire knowledge of this 
structure and to use it. If, however, as suggested by Gibson et al. (1970), 
ability to acquire -knowledge of orthographic structure does not depend oh 
availability of normal speech input, then deaf adults may still be able to 
acquire this knowledge. To investigate whether these deaf adults differ from 
hearing adults in the use of orthographic structure, the performance of a 
group of normally-hearing subjects was compared with that of deaf subjects. 

The use of orthographic structure was investigated testing recognition 
and recall of fingerspelled letter strings. Fingerspelling is a manual 
communication system based on English in which words are spelled out by the 
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sequential production of letters of the manjaal alphabet. As shown itf Figure 
1, the American manual alphabet has a handshape for each letter of ttfe English 
alphabet* Fingerspelling is used both in American -Sign Language (ASL or 
Ameslan) and in manual communication systems based on English. 

In fingerspelling, words are presented as a temporally sequential display 
of individually produced letters with an average presentation rate of 20 msec 
per letter (Bornstein, 1965). Letters are displayed with the hand held in one 
spatial location. For printed letters, display characteristics such as this 
make word recognition difficult. With sequential presentation of printed 
letters displayed in one spatial location, normally-heaping readers can 
accurately name words only when the duration of each letter is at least 375 
msec (Kohlers & Katzmar., 1966). Even when the printed letters are spatially 
distinct, ability to read words is dramatically reduced for sequentially 
displayed individual letters compared with multi-letler displays (Newman, 
1966). Fingerspelling' provides an interesting case in word recognition in 
that fingerspelled words can be recognized at rates that are difficult for the 
recognition of sequentially presented printed letters. For this reason, a 
secondary goal of the present research, was to examine skilled reading of 
fingerspelling. 

A sequential presentation of letters might suggest sequential recognition 
of individual letters. However, it may be that, similar to the recognition of 
printed words, orthographic structure is* used in the recognition of 
fingerspelled words. Since it has been demonstrated that there are "co- 
articulatory" effects in skilled fingerspelling, with letter context 
influencing letter production (Reich, 197*0, this could allow for the use of 
orthographic structure in word recognition. 

In the present experiment, fingerspelled words, pseudowords, and nonwords 
were presented to deaf and hearing adults skilled in the use of fingerspel- 
ling. If orthographic structure is used in processing the fingerspelled 
stimuli, then letters of orthographically regular nonsense word3 should be 
recalled more accurately than letters of orthographically irregular nonsense 
words. Errors in letter report for words were analyzed to 'examine orthograph- 
ic regularities in production for both deaf and hearing subjects. 



METHOD 

* ' Stimuli 

Sixty stimulus items were used. Thirty were real^ words chosen from 

samples— of- words- found-misspelled in writing-by dea"r"adurts/ These words . 

ranged in length from five to thirteen letter?, mean length being 8.3 letters 
per word. The words ranged in frequency of occurrence from 1-190 (median of 
10.5) according to Kucera and Francis (1967). These thirty words were matched 
in mean length' with 20 orthographically regular pseudowords (e.g. , BRANDIGAN, 
MUNGRATS, VISTARMS) and 10 orthographically irregular nonwords (e.g., FTER- 
NAPS, PKAN7, VETMFTERN) . .The selection criteria for the orthographically 
regular and irregular words were in accord with the criteria outlined in 
Appendix A of Massaro, Venezky, and Taylor (1979). According to these 
criteria, the regular strings (pseudowords) were pronounceable and had ortho- 
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graphically legal spelling patterns. The irregular strings (nonwords) con- 
tained unpronounceable consonant clusters. A complete listing of the stimuli 
is given here in the Appendix. 

. .Stimuli were recorded on videotape by a deaf native signer of ASL (i.e., 
a person who had deaf parents and had learned ASL as a first language) . The 
signer made no mouth movements nor facial expressions that would indicate the 
lexical status of items. Measurement of the length of each recorded item 
revealed a mean presentation rate of 354 letters per minute. This rate is 
consistent with the rate found by Bornstein (1965) to be a natural ASL rate. 
The production rate for the thirty words did not differ from the production 
raxe for the other thirty items, t(58)=1.87, £>.05. Words, pseudowords, and 
nonwords were mixed throughout the list. Following each item, a blank 
interval of approximately. 10 seconds was recorded for use as a response 
period. 

Procedure 

Subjects were instructed that they would see many f ingerspelled items and 
that for each they were to make two responses: First, write the letters of 
the item they had just seen; second, make a lexical decision. They were to 
cirole u YES or NO on, their answer sheet to indicate whether they thought the 
presented letter string was or was not an actual word. The instructions, 
signed in ASL, were recorded on videotape. 

Subjects were run in groups of one to six persons. The entire experiment 
lasted approximately 30 minutes. 

Subjects 

A group of deaf subjects and a group of hearing subjects were tested. 
Subjects in both groups had deaf parents and had learned f ingerspelling from 
their parents. 

Deaf subjects were 14 congenitally deaf .adults recruited through New York 
University and California State University, Northridge. All were profoundly 
deaf. There were six women and eight men, ranging in age from 17 - 53 years, 
median age 28.5. years. 

Hearing subjects were recruited through interpreter services in Connecti- 
cut and New York. There were five women and three men ranging in age from 
22-49 years, median age 29 years. 

RESULTS AND DISCUSSION 

To examine possible processing differences for the two groups, the eight 
hearing subjects were matched in overall accuracy with eight deaf subjects. 
Overall accuracy was determined for each subject as the percentage of correct 
responses across conditions. Only items for which there was both a correct 
lexical decision and a correct report of all letters were considered to be 
correct responses. Overall, hearing subjects had an accuracy rate of 43.7* 
(range 21.7* - 65.0*). Eight deaf subjects, whose accuracy was in the middle 
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of the performance range for the 14 who participated, performed at a 
comparable level. Overall they were MO.8% accurate ( range 20.0? -70.0%), 
which was not significantly different from the accuracy of the hearing 
subjects, UlMJs.S 1 *, £>.05. Further analyses are based on these two matched 
groups of .eight subjects each-. 

Lexical Identification 

Mean overall accuracy for the lexical decision task was 85.5%. Analysis 
of .group (deaf, hearing) by stimulus type (words, pseudowords, nonwords) found 
that there was no significant difference in accuracy across stimulus type , 
F(2,28)=.88, MS e =70.06, n>*05 f nor was there an interaction between, group and 
stimulus type, F(2 ,28)= *H , MS e= 70.06, £>.05. There was a tendency for deaf 
subjects to perform this task more accurately than hearing subjects, although 
the difference only approached significance, F( 1 , 1U)=3 .00 , MS e =339.35, j)<.20. 
*The performance of both groups of subjects in this task is shown in Table 1. 



, Table 1 

« 

Mean percentage correct lexical decisions and correct identification of words. 



Deaf Hearing 

Lexical Decisions 

Woi js 91.7% 80. 4% 

Pseudowords 89.'*% 84. U% 

Nonwords 88.8% 77.5% 

Word Identification! 
correct lexical decision 



93.0% 92.6% 



To ensure that this high accuracy could not have been due to some non- 
linguistic cue to wordness of the stimulus items (e.g., facial cues or 
"awkward 11 production of pseudowords and nonwords), eight hearing adults, naive 
with respect to fingerspelling, were asked to make lexical decisions regarding 
the stimuli. They viewed the videotape and were told that for every 
fingerspelled item they were to circle YES or NO on their answer sheet to 
indicate whether or not they thought the item was an actual word. This group 
of naive adults was only M9.2% accurate in the task, a rate that does not 
differ from chance performance, x 2 (D=»05, j>>.05. Therefore, the high accura- 
cy of the two groups of deaf and hearing subjects in this task can be 
attributed to their knowledge of fingerspelling. 
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The whole report technique of the present experiment allowed for a 
determination as 'to whether or not there was correct identification of words. 
Three types of response errors were considered to be 'failures to identify the 
word. First were those responses on which more than 50% of the letters were 
omitted. These omissions were rare; only three such errors were made (by deaf 
subjects) . The second source of error consisted of responding with a 
morphologically incorrect formof'the word Ke.g., baptized for BAPTIZE) and 
accounted for three errors of the deaf subjects and five errors of the hearing 
subjects. The third source of error consisted -of responding with the wrong 
word (e.,g., complicate for COMMUNICATE), accounting for five errors'of the 
deaf subjects and nine errors of the hearing subjects. Table 1 presents 
subjects' accuracy of word Identification given a, correct lexical decision. 
Deaf and hearing subjects did not differ in their accuracy on word .identifica- 
tion , t(14)=.09, p>.05. 

These latter two sources of error in 'lexical identification appear, to 
result from guessing the word on the basis of a few letters. It should be 
pointed out that th\§ strategy of identifying a word on the basis of a few 
perceived letiSers "i3> not a bad one in normal conversations. In these 
conversations, letters of f ingerspelled words are often omitted or sloppily 
produced CCaccamise, Hatfield, & Brewer, 1978), but within the syntactic and 
semantic context provided by the conversation, word identification from 
r partial information is possible. In the present task, however, recognition of 
only a few letters led to the errors in lexical identification. These errors 
resulted both in incorrectly identifying actual, words and in incorrectly 
responding that pseudowords and nonwords were words (e.g., raps , for RAPAS and 
veteran for VETMFTERN) . 

7 — 

Letter Report Accuracy 

Given a correct response on the lexical decision taak and a correct 
lexical identification of the words, how accurate were subjects at reporting 
all the letters of an item? A summary of this performance by the two groups 
on each word type is shown in Table 2. An analysis of the percentage correct 
was performed on group (deaf or hearing) by word type (word, pseudoword, 
nonword) for trials on which there was a correct .exical decision and 
identification. The analysis revealed a strong effect of word type, 
F(2,28)=170;03, MS e=12 q 32, p<.001. This difference was significant between 
all word 'types (Newman-Keuls, £ < .01 ) , thus indicating effects of word famili- 
arity (letters of words better recalled than letters of pseudowords) and 
orthographic structure (letters of pseudowords better recalled than letters of 
nonwords). There was no main effect of group for accuracy of letter report, 
F(1 ( lDs1.65, M§ e =368.33, p>.05, but there, was an interaction of group by word 
type, F(2,28)--6.70, MS e =129.32, p<.005. Analysis of the simple effects 
revealed that the two subject groups differed in letter report accuracy for 
words, F(1,28)=13.93. £<.001, but did not differ significantly in letter 
report accuracy for pseudowords, F(1,28)=..00, p>.05, or nonwords, F-( 1 ,28)= .29 , 
2>.05. Thus, the interaction of group by word type was due to greater 
accuracy by hearing than deaf subjects on letter report for words. 
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* Table 2 

Mean percentage correct report of all letters given a correct identification 
of words and a correct lexical decision fof the pseudowords and nonwords. 



Words 
♦ Pseudowords 
Nonwords 



Deaf 
70. 2% 
30. 7% 
9.3% 



Hearing 
92. 3% 

31. n 

6. OX 



If "subjects were using orthographic structure in the processing of words 
and pseudowords, there should be honindependence of letter report for these 
stimuli. That is* the probability of letter report of a given letter should 
be a function of * the probability of the recall of the" other letters, in the 
word. This interdependence of letter report' would not be expected to be 
involved in letter report for nonwords, however, since principles of English 
orthography are violated in these nonwords. Tests for independence of letter 
report were * performed separately on- words, pseudowords, and nonwords. 
Independence is indicated if the following equation holds: 

n 

p(all- letters of an item) = >p( individual letter) (1) 

where n=number of letterS^fcn the word. 

"Analyzing for group (deaf or hearing) by word length by probability .(all 
letters vs. individual letters), it'was found that for 'word* and , pseudowoVds 
the probability of correctly reporting all the letters of the item was greater 
than the probability of reporting the letters independently: - for words,' 
£(1,1*0=71.71, MS e =263.J40, £ <.Q01; for pseudowords, F(1., 14)s26.95 v MS e =285.02, 
jK.001. The effect did not interact; with subject population for either the 
analysis of words, F( 1 , 14)= .22, • MS e =263.40, £>.05, or pseudowords, 
F( 1,1*0=. 81, MS e= 285.02, £>.05. Thus*, the letters of words and pseudowords 
were 'not, processed independently. However, for the f ingerspelled i£ems that 
violated orthographic structure "(the nonwords), the probability of correctly 
reporting all the letters of the item was not greater than, the probability of 
independently reporting each letter, F(1 ,1*0 = 1 .98, MS e =103.43, £>.05. For 
nonwords, therefore, the letters were processed independently. As before, 
there was no interaction, with subject group, £(1,1^5 = 1.35, MSe=103.*43, j>>.05,. 

These results give evidence for the ability , of . deaf adults to use 
orthographic structure. Similar to the orthographic structure effects previ- 
ously reported for deaf adults by Gibson et al., ( 1970) , the present study 
found greater accuracy in letter report for pseudowords than nonwords. In 
accord With these 'findings, the nonindepetfd^nce of letter processing for words 
and pseudowords indicates interdependence of letter processing. That is, 
processing of a given letter was influenced by other letters of the word or 
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pseudoword . There Were also other indications that deaf and hearing subjects 
in the present experiment were aware of violations of English orthography: 
When , .the fing*P#pelled nonwords were presented, subjects often laughed. 
Generally a look of surprise would appear. on their faces at these violations 
of orthographic structure. -n . 

Together, the above results also clearly indicate that orthographic 

struct ure is- used in processing finger sp elli ng. They Indica te that even 

though letter presentation is temporally sequential, letter processing is 
influenced by surrounding letters. Since there are coarticulatory effects in 
skilled fingersp'elling, it is reasonable to assume that a fingerspelled letter 
contains information about? adjacent letters. A skilled fingersjjeller wo.uld, 
therefore, be expected to make use of this context information in word 
recognition (see Wickelgren [1969, 1976] for a discussion of context-sensitive 
'coding in speech), rttis context-sensitivity may explain how orthographic 
structure is'able to boused in identifying fingerspelled letters despite the 
temporally sequential display of letters^ and may explain how these sequential 
letters can be processed so much more rapidly, than sequentially presented 
printed letter s. r 

In addition to the effects of orthographic structure, word familiarity 
effects were found here. These word familiarity effects, involving better 
recall of letters of words than letters of pseudowords, are consistent with 
the greater accuracy of letter report for letters of printed words than for 
letters of printed pseudowords (Manelis, 197^; Spoehr 4 Smith, 1975). A word 
superiority effect of fingerspelled words over fingerspelled nonwords, consis- 
tent with the present findings, has been reported earlier by Zakia and Haher 
(1971). 

Some theorists attribute the word familiarity effect to the fact that 
words allow for holistic recognition of visual configurations. However, it is 
unlikely that this interpretation can account for the present results for the 
following reason: The majority of stimulus words would rarely, if ever, have 
been seen- as fingerspelled words by the subjects prior to this experiment 
because" the words wouJ.d tend to be signed rather than fingerspelled in signed 
conversations. The familiarity that the subjects have with these words, 
therefore, would be with the printed form of the word. This situation is 
analogous, perhaps, to that of the recognition of mixed-Zcase printed words in 
that the orthographic integrity of the words is preserved, but the visual 
configuration is disrupted. Studies have found the while there is a perceptu- 
al advantage for same-case over mixed-case words, word, familiarity effects are 
obtained with mixed-cases, indicating that > the word familiarity effect need 
not be totally attributable to holisti-c word recognition (Coltheart & Freeman, 
1974; McClelland, 1976); What, then, contributes to superior letter report 
for words in the present experiment? 

Two factors appear to be involved. The first is that the associations 
between letter sequences of words should be stronger than the associations for 
the sequences of permissible although novel items. These stronger associa- 
tions would allow more perceptual facilitation of letters for words than 
pseudowords (Adans, 1979) ♦ Thus, the letters would be more accurately 
recognized for words than for pseudowords. The* second factor contributing to 
the word familiarity effect is one of memorability. Pseudowords and nonwords 
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represent novel letter sequences. The subjects must recall the letters based 
on a single presentation* But for words, once the word is recognized, the 
subjects are able to bring their productive spelling abilities to bear on the 
task of letter report* Incorrect letter reports for these words, in this 
respect, represent spelling errors. 

Incorrect Letter Reports for Words 

Each incorrect letter report for a correctly identified word was scored 
in three way!: (1) each was classified as to whetner or not the reported 
letter string produced a sequence that preserved the phonetic structure of the 
presented word. (2) each was classified as to whether the reported sequence 
was orthographically regular or irregular, and (3) each was classified as to 
the type of error. 

For hearing adults and children, the predominant form of spelling error 
is a phonetically consistent but orthographically incorrect rendering of the 
intended word (Fischer. 1980; Frith. 1980; Masters. 1927).. In these misspel- 
lings, each phonetic segment of the word is t grap temically represented in the 
order of occurrence. The phonetic structure is therefore maintained in the 
misspelling. Did the incorrect responses for words in the present' experiment 
preserve the phonetic structure of -the words presented? Analysis revealed 
that the hearing subjects made more incorrect letter reports phonetically 
equivalent to the target word than did the deaf subjects, x W =10.01 f *p<. 005. 
For hearing subjects, the mean percentage of such responses was 63. o%; for 
deaf subjects, the corresponding percentage was only 18.6%. 

But while the letter reports for deaf subjects were not consistent with 
the phonetic structure of the target word, by and large, the responses were 
■orthographically regular. Orthographically regular words, in accord with 
Massaro et al. (1979), were both pronounceable and contained only legal 
consonant and vowel clusters. For deaf subjects, 93-8% of the \incorrect 
responses were regular English letter sequences. For hearing subjects, 95-8% 
were regular. There was no difference in the frequency of deaf and hearing 
subjects making such responses, x 2 (l)=-62, £>.05. This indicates that, like 
hearing adults, deaf adults have a definite knowledge of English orthographic 
constraints. 

The incorrect letter reports were further classified using the following 
categories of error type: letter deletions, substitutions, insertions, and 
transpositions. As shown in Figure 2, a major difference in error type for 
deaf and hearing subjects was the tendency for deaf, but not hearing, subjects 
to order the letters of words incorrectly, resulting in a transposition of 
phonetic segments. Some examples are adveristement for ADVERTISEMENT, funreal 
for FUNERAL, hemsiphere for HEMISPHERE, viedo for VIDEO, and vechile for 
VEHICLE. While deaf Subjects made 17 errors of this type (representing 23.9% 
of their total errors), only : 1 such error was made by hearing subjects (7*7* 
of the total). For hearing persons, the incidence of letter transpositions is 
generally so low that it may be possible to account for all the errors in 
spelling experiments without even including a category for letter transposi- 
tions (Fischer, 1980). It is interesting to note that none of these 
transpositions preserved the phonetic structure of the words. In all cases 
the transposed letters incorrectly ordered phonetic segments. 
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Little work has been undertaken to understand the spelling process for 
deaf persons/ The present finding of the low percentafge of letter reports 
phonetically equivalent to the target is of great interest as it suggests that 
the cognitive processes underlying Spelling for deaf adults may be fundamen- 
tally different from those underlying productive spelling for hearing adults. 
Models of. the spelling process for hearing persons commonly hypothesize that 
productive spelling involves generating a phonetic representation of the 
target word and then generating possible orthographic realizations of this 
representation (Frith, 1980; Simon & Simon, 1973). These models therefore 
account for the tendency of hearing persons to make misspellings that preserve 
the phonetic structure of the intended word. 

The low incidence of phonetically consistent letter reports by the deaf 
adults in this experiment suggests that the spelling process of deaf persons 
is not well described by the'Sse models. ' The few studies that have been 
concerned with the spelling process for deaf persons have been conducted with 
deaf children. Dodd (1980) examined the spelling of words by orally-trained 
deaf children in England.; The task in Dodd's experiment was to lipread words 
pronounced by 'the experimenter and then spell the words. The children (mean 
age 1*4.5 years) made orrly abort 11% misspellings that were classified as 
reflecting the phonetic structure of the pronounced words. It should be 
noted, however, that 64. 8% of the children's errors were classified as 
"refusals" to spell the pronounced word. If only the actual misspellings of 
the children are considered in Dodd's data, the incidence of phonetically 
consistent misspellings is 31.5%. 

In another experiment designed to determine the underlying spelling 
processes of deaf children, Hoemann, Andrews, Florian, Hoemann, and Jansem^ 
(1976) found that few of the misspellings of the deaf and hearing-impaired 
children they tested could be considered phonetically equivalent to. the target 
word. Children in their experiment were ages 6-19 years and were being 
educated with the Rochester Method (combined speech and f ingerspelling) . The 
children were presented with pictures of objects and were asked to spell the 
name of each of the objects. Earlier work had found that the majority of 
misspellings made by hearing children on this task were phonetically consis- 
tent with the target word (Mendenhall, 1930). No more than 19% of the 
misspellings of the children tested by Hoemann et al. could be considered 
phonetically equivalent to the target. 

Cromer (1980) analyzed samples of free writing from six orally-educated 
deaf children in England (median age 10.5 years). By Cromer's analysis, 
62.25% of the misspellings of the deaf children were "phono-graphic errors," 
defined as resembling "in some respect the sound of the target word when 
pronounced" (p. 412). By this definition, errors such as basking for "basket" 
and amanals for "animals" were scored as phono-graphic errors. Thus, not all 
these phono-graphic errors would be phonetically consistent with the target. 
Examining Cromer 1 s corpus of errors v according to the present concern of 
phonetic misspellings, it \s apparent that only few of the misspellings can be 
considered to be consistent with the phonetic structure of the intended word. 

One final point is worth mentioning. The deaf children studied by Cromer 
(1980) made transpositions similar to those made by the adults in the present 
study. Cromer classified these transpositions and ordering errors under the 
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category "Visual Errors." While this type of error accounted for 15.75% of the 
errors made by the deaf children in his study, no such errors were made by the 
normally-hearing control children. As with the present findings, these 
ordering errors did not preserve the phonetic structure of the target word. 

Together, these studies with children and the present one with adults arje 
consistent, a consistency that is especially striking given the differences in 
methodology of these studies and the differences in the age and language 
background of the subjects. These studies converge on the findi g that deaf 
persons do not make phonetically consistent spelling errors to th.- degree that 
hearing persons do. The suggestion from these findings is that the spelling 
process for deaf persons may be fundamentally different from the spelling 
process for hearing persons. In comparison with hearing persons, it appears 
that deaf persons > may make less use of a stored phonetic representation of 
words when spelling. The types of errors they make appear to be consistent 
with what Ellis (in press) terms as errors based on "partial lexical 
knowledge" in which the speller knows "some but not" all of the letters, or all 
of the letters but not the correct order." While the errors of deaf adults in 
the present experiment were often consistent with this definition, it should 
be borne in mind that deaf signers may have additional strategies available to 
them. 

One strategy of deaf -signers in the present experiment deserves mention: 
'Very often the deaf subjects f ingerspelled items on their hands before writing 
their responses. This f ingerspelling often allowed them to try different 
spellings in an effort to decide the correct letter sequence. This "trying 
out" spellings should not be thought of as equivalent to the strategy of 
writing down various spellings of a word to determine which "looks" correct. 
The deaf subjects occasionally employed this strategy also. Rather, the 
manual strategy seems more to determine which spelling "feels" correct. Often 
subjects did not even look at their hands while trying out the letter 
sequences. Often the hand they were not using for writing their answers was 
held under the table as they f ingerspelled tie different sequences. This 
suggests, therefore, that whereas a component of the spelling process for 
hearing persons is phonetic , a component of the spelling process for deaf 
sife." ,ar s may be kinesthetic . 

CONCLUSIONS 

The present experiment clearly indicates that deaf adults are able to 
make use of orthographic structure. This was shown both in the recall 
advantage for orthographically regular over orthographically irregjlar letter 
strings, similar to the findings of Gibson et al. (1970), and in the analysis 
of errors in letter reports for words. These results support the conclusion 
of Gibson et al. that while the mapping of speech sounds to graphemes may 
facilitate the acquisition of orthographic structure for hearing persons, 
cbngenitally and profoundly deaf persons are, nevertheless, able to acquire 
this knowledge. 

The analysis of letter report errors for words indicates that the deaf 
adults were sensitive to the orthographic regularities of English in their 
productions. But in contrast to the hearing adults, the responses of deaf 



adults were not consistent with the phonetic structure of words. These 
results suggest that deaf adults may use orthographic but not phonetic 
structure when spelling. 

A secondary goal of the present research was to examine recognition of 
fingerspelled words. The work suggests' that signers, both deaf and hearing, 
make use of orthographic structure in the processing of fingerspelling. 
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Appendix 



Words 



Pseudowords 



ADVERTISEMENT 

AWKWARDLY 

BANKRUPTCY 

BAPTIZE 

CADILLAC 

CAREFUL 

CHIMNEY 

COMMUNICATE 

ELABORATE 

FUNERAL 

GRADUATE 

HELICOPTER 

HEMISPHERE' 

INTERRUPT 

MOUNTAIN 

PANTOMIME 

PHILADELPHIA 

PHYSICS 

PREGNANT 

PSYCHOLOGICAL 

PUMPKIN 

RHYTHM 

SUBMARINE 

SURGERY 

THIRD 

TOMATO 

-UMBRELLA 

VEHICLE 

VIDEO 

VINEGAR 



BRANDIGAN 

CADERMELTON 

CHIGGETH 

COSMERTRAN 

EAGLUMATE - 

FREZNIK 

FRUMHENSER 

HANNERBAD 

INVENCHIP 

MUNGRATS 

PHALTERNOPE 

PILTERN 

PINCKMORE 

PRECKUM 

RAPAS 

SNERGLIN 

STILCHUNING 

SWITZEL 

VALETOR 

VISTARMS 

Nonwords 

CONKZMER 

ENGKSTERN 

FTERNAPS 

HSPERACH 

PGANTERLH 

PIGTLANING 

PKANT 

RANGKPES 

RICGH 

VETMFTERN 



42 



48 



EXPLORING THE INFORMATION SUPPORT FOR SPEECH* 
J, A. Scott Kelso+ and Betty Tuller*+ 



Abstract . A well-established feature of speech production is- that 
talkers, faced with both anticipated and unanticipated perturba- 
tions, can spontaneously adjust the movement patterns of articula- 
tors such that the acoustic output remains relatively undistorted. 
Less clear is the nature of the underlying process(es) involved. In 
this study we examined five subjects' production of the point vowels 
/i, a, u/ in isolation and the same vowels embedded in a dynamic 
speech context under normal conditions and a combined condition in 
which (a) the mandible was fixed by means of a bite block, (b) 
proprioceptive information was reduced through bilateral anaestheti- 
ze tion of the temporomandibular joint, (c) tactile information from 
oral mucosa was reduced by extensive application of topical anaesth- 
etic, and (d) auditory information was masked by white noise. 
Analysis of foraant patterns revealed minimal distortion of the 
speech signal under the combined condition. These findings 'are 
unfavorable for^central ('e.g., predictive simulation) or peripheral 
closed-loop models, both of which require reliable peripheral infor-' 
mation; they are more in line with recent work suggesting that 

movement go als may ha fmhiftygri hy mnartlft noil ftp tivfts that h^h a vft in 

a way that is qualitatively similar to a nonlinear vibratory system. 

The remarkable generativity of human movement is. a mystery that continues 
to resist explanation. Withiii limits, people (and animals) can achieve the 
v same 'goal 1 through a variety of kinematic trajectories, with different muscle 
groups and in the face of ever- changing -postural and biomechanical require- 
ments. This phenomenqn- -variously referred to as motor equivalence (Hebb, 
1949) or equifinality (von Bertalanffy, 1973) — has been demonstrated again 
quite recently by Raibert (1978), who showed writing patterns to be charac- 
teristic of the same individual even when produced by structures (such as thfe 
foot or mouth) -that had never previously been used for the act of writing. 



*A preliminary version of this pajper wac presented at the 101st meeting of 
the Acoustical Society of* America, May 18-22, 1981. 
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Human language is generative in a qualitatively similar way: W* seem to 
have a potentially infiaite number of ways of constructing sentences. Nor is 
it trivial that language, even when stripped of its symbolic component, is a 
creative or generative activity. Articulatory maneuvers for producing speech 
sounds can be effected in spite of continuously varying initial conditions. 
Often the same phonetic segment in different environments can be achieved by 
very different movement trajectories and end-states. One commonly used 
experimental paradigm for examining equifinality in speech takes the fcrm of 
placing a bite block between the teeth, thus fixing the position of the 
mandible. Under such conditions, so-called "steady state" vowels can be 
produced apparently without the need for on-line acoustic feedback. Normal 
range formant patterns are obtained even at the first glottal pitch pulse 
(Gay, Lindblom, a Lubker, 1981; Lindblom, Lubker, a Gay, 1979; Lindblom a 
Sundberg, 1971). Moreover, -speakers are capable of such "compensatory articu- 
lation" with little (if any) articulatory experimentation. Recent work on 
bite-block speech has shown that response times * produce vowels of the same 
acoustic quality under normal and bite- block cond i.ons are nearly identical. 
In addition, the degree of "compensation" (as indexed by deviations from 
normal formant frequencies) remained unchanged as a function of practice 
(Fowler a Turvey, 1980; Lubker, 1979). The evidence, then, favors an 
interpretation that articulatory adjustments to novel contextual conditions 
are relatively immediate. 

What kind of control process could account . forjb^e^adaptive, generative 
nature of speech production".' AiT open-loop control system in which commands 
for producing a given vowel prescribe in detail "the activities of "relevant 
muscles can be dispenaed with because, by definition, such systems are 
insensitive to changing contextual conditions. On the other hand f| closed- loop 
control does offer the advantage of adjustment to initial conditions. In 
peripheral closed-loop, feedback systems, a sensory goal in the form of a 
spatial (MacNeilage, 197^0 or auditory ^target (Ladefoged, DeClerk, Lindau, a 
Papj'un, 1972; MacNeilage 1980) is paired with an appropriate set. of commands 
for accomplishing the goal, Resulting sensory consequences are then compared 
with the sensory goal 30 thtft corrections can be made. A potential problem 
with peripheral closad-locr control is that the corrective process requires 
time (at least one cycle around the corrective loop). However, if the 
adjustment to novel conditions is indeed immediate- -thus excluding the need 
for trial and error methods — then a closed- loop mechanism tied to the 
peripheral motor system fails to capture trie phenomenon of interest. 

"An alternative account favored by Lindblom and colleagues (e.g., Lindblom 
et al., 1979) replaces the peripheral feedback loop by a^ central simulation 
process that derives the expected sensory consequences from a simulated set of 
motor commands before tho actual efferent signals are sent to the periphery. 
An internal comparison between the simulated and 'target' sensory consequences 
yields an error signal on the basis of which new (and oorrect) commands can be 
emitted. In this manner, adjustments to changes in context °can be made in the 
internal, simulation without incurring erroneous effects at the periphery. 

It is important to note that the models discussed thus far make the 
explicit assumption that reliable peripheral information about the articula- 
tors 1 initial conditions is available before motor commands (simulated or 
actual) are generated. In the peripheral closed-loop model, for example* 
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sensory input roust be compared to the internal referent before the output of 
command signals. In the internal loop model, simulated motor commands are 
generated for the initial conditions that currently exist (Lindbiom et 
al. 1979). It is not clear in the latter formulation what would happen if 
contextual conditions changed between the time that simulated and actual motor 
commands were generated, A more efficient system would be continuously 
sensitive to, and be capable of modulation by, contextual conditions. For the 
sake of argument, however, let us assume with Lindbiom et al • that one benefit 
of the internal loop is its speed of correction; possibly the loop is so fast 
that appropriate output can be generated before contextual conditions have 
^hanged ♦ 

In any case, for both closed-loop models, elimination or reduction of 
peripheral information about initial conditions should drastically affect the 
system's ability to adjust to the novel situation created by a bite block. 
There are very limited data on this point. Gay and Turvey (1 979) found that a 
single subject (a phonetician) made several attempts before producing, 1 normal' 
formant frequencies for the vowel / i/ under conditions in which a bite block 
was combined with* topical anaesthesia of the oral mucosa and bilateral nerve 
blockage of t&e temporomandibular joint. Although this result has suggested 
tb some (cf. Perkell, 1979) that joint and tactile information is used to 
establish an " orosensory fra*ne of reference," we believe there are grounds for 
caution. One problem is that it is unclear how--given the considerable 
reduction of peripheral information--Gay and Turvey 1 s subject was capable of 
adaptive efdjustraent at all. One possibility, *which we consider here, is that 
auditory information may have played a potentiating role. Although acoustic 
information does not appear to be a necessary condition for compensatory 
articulation (e.g., Lindbiom et al.,1979^, the Gay-Turvey experiment does not 
preclude an auditory contribution in "recalibrating" the speech system when 
information from motor structures is rendered unreliable. 

The present experiment was designed to examine the role of peripheral 
information (auditory and somesthetic) in accounts of "immediate adjustment" 
by asking naive subjects to produce vowels unde ? r normal conditions and under 
bite-block conditions in which, somatosensory information was drastically 
reduced (if not eliminated) and audition was masked by white noise. In 
addition we address the question of whether the so-called "steady-state" 
paradigm for bite-block vowels reflects normal dynamic speech motor processes. 
By examining the production of vowels embedded in a dynamic speech context as 
well as in isolation, we can discover what differences there are, if any, in 
observed acoustic patterns. As we shall see, the availability of peripheral 
information from neither auditory nor peripheral motor structures appears to 
be crucial to immediate adjustment. We take this result as non- supportive for 
extant models of the phenomenon^ In their place, we offer a class of model-- 
emerging in other areas of motor control (Bizzi, 1980; Fel' dman, *1 966, 1980; 
Kelso, 1977; Kelso & Holt, 1980; Kelso, Holt, Kugler, & Turyey, 1980; Polite 
Bizzi, 1978) as well as in the recent speech production literature 
(cf. Fowler, 1977; 1 Fowler, Rubin, Remez, & Turvey, 1980)--that identifies 
functional groupings t>f muscles as exhibiting properties qualitatively similar 
to a nonlinear oscillatory system. The bottom line of this model and of the 
present paper is that the equi finality characteristic of vowel production may 
not be prescribed *by closed- loop servomechanisms of the peripheral or central 
kind. Rather, we argue that it may be a consequence of the parameterization 
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of a dynamical system whose design is intrinsically self-equilibrating. That 
is,, a design in which equilibrium points are a natural by-product of the 
stiffness and damping specifications for the vowel- producing system. 

Subjects . Four female volunteers w6re paid . to participate in this 
experiment. All were naive to the purpose cf the experiment. A fifth subject 
(male) who was phonetically trained and had prior experience in a similar 
experir-nt (see Gay A Turvey, 1979) was also included. 

Stimuli . The subjects' task was to say the point vowels /£, a, u/ in 
isolation and in a /p/- vowel-/p/ context. The / pVp/ syllables were spoken in 

the carrier phrase "A again." Utterances were produced in three groups of 

three tokens of a particular vowel or phrase. The subjects were instructed to 
produce all tokens of a given utterance in exactly the same fashion, with a 
clear pause after each token. They were also told not to * talk between 
experimental conditions or to practice the production task. 

Conditions . The bit$ block used was a small acrylic cylinder^ with wedges 
carved out of each end so that it could fit snugly between the. teeth. A 5 mm 
bite block waff used to restrict the normally low jaw position for production 
ofVa/ and /pap/. Either a 17 mm or a 23 mm bite block was used (depending on 
the individual subjects oral dimensions) for production of 'j f u, pip, pup/, 
whrch normally involve a high jaw position. 

All anaesthetic procedures were- performed by Dr. Robert Gross, a special- 
ist in oral and maxillofacial surgery who had collaborated with us in earlier 
work (Tuller, Harris, A Gross, 1981). Tactile information from the oral 
mucosa was reduced by sprayinjg the surface of the tongue and oral cavity with 
a 2% Xylocaine solution. The effectiveness of the topical anaesthesia « was 
tested by pricking the surfaces with a needle until the subject no longer 
reported sensation. A few catch trials were also included in an attempt to 
insure honest reporting on the pa'rt of the subject. Information from 
mechano receptors in—the jaw was reduced by injecting percutaneously a 2% 
Xylocaine solution directly into left and right temporomandibular joint 
capsules to achieve auriculotemporal nerve blockage. Chemical blockage of 
this nerve drastically impairs perception of joint position and movement 
(cf. Thilander, 1961). This condition will be referred to as the TMJ block. 

-In order to restrict the availability of auditory information, white 
noise was presented to the subject over headphones at approximately 90 dB. 
The subject was told to monitor the amplitude of her or his productions by 
watching a VU meter and to restrict the excursion of the needle to approxi- 
mately 55 dB or under. 

All subjects spoke \< th and without the bite blpck prior to the 
application of anaesthesia and under all experimental conditions. Two of the 
four naive subjects received the TMJ block before the topical anaesthesia, and 
the other two subjects underwent topical anaesthesia first. In each of these, 
pairs, one subject spoke under conditions of auditory masking and the other 
subject was allowed normal auditory information. The 1 phonetically trained 
subject received topical anaesthesia before the TMJ block and , spoke with 
masking noise in combination with these two procedures. 
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, Measurement procedure . Individual utterances were input through a Ubi- 
quitous spectrum analyzer to a Honeywell DDP-224 computer, using a 12.8 msec 
window and 40 Hz frequency resolution. The first and second formants of each 
utterance were measured from a spectral section display. As in previous 
experiments (e.g., Lindblom et al., 1979; Fowler & Turvey, 1980), acoustic 
measures of the isolated vowels were made at the first glottal pulse. For 
many English speakers the isolated vowels may not be truly static, that "»s, 
they may show some artdtoulatory movement and thus some shifting of formal. 4 ; 
frequencies; nevertheldp, the adopted procedure was to measure, fonrap 
frequencies at the firsr glottal pulse. * For the /p/- vowel- /p/* syllables, 71 
and F2 values were taken from the point within the vowel at which F2 was most 
extreme. This point was chosen as the closest approximation to the "target" 
vowel formants. 

Results . The main interest of the present experiment rests on a 
comparison of speech under normal conditions and conditions of reduced 
peripheral information. Figure 1 shows the mean values for F1 and F2 for each 
^subject. The top half shows the mean formant values for the isolated vowels, 
and the bottom half the mean foraant values for the /p/-vowel-/p/ syllables. 
Tne conditions .of speaking are coded as -follows: "M" means the subject 
produced the utterances under conditions of masking noise, "J" is the TMJ 
block, "T" corresponds to topical anaesthesia, and "BB" is the bite block. 
Each subject's nine normal productions of a given "^utterance were compared 
using t-tests with his or her productions under the most extreme condition of 
sensory deprivation. None of the subjects (except subject 1 ). showed any 
differences in formant frequency values between normal and , deprived condi- 
tions. Such was the case regardless of whether vowels were spoken in 
isolation or in a consonantal frame; t(8) values ranged from .05 to 1 .79, J>s 
>.1. For Subject 1, a significant mean difference occurred between the normal 
afld most deprived- condition only fof* F1 and F2 of the vowel /u/. *We are hard 
put to account for these anomalies: The, effect on /u/, though substantial (a 
97 Hz difference for F1 and a difference of 363 Hz for F2) is in the direction 
opposite to expectation. Specifically, the presence of a bite block might be 
expected to raise all formant frequencies when producing /u/ because of 
possible structural limitations on lip protrusion and constriction. In 
contrast, however, this subjects productions of>*/u/ with a bite block 
actually showed lower F1 and F2 frequencies than when there was no bite block. 
Neither can the effect he attributed only to masking, (which might implicate 
higher formant frequencies). Notice that the foraant values for combined 
sensory deprivation conditions are very similar with and without masking. It 
is also worth remarking that S5 in Figure 1 is the phonetically trained 
subject whose , results confora to the general pattern shown by naive subjects. 

Before concluding that these results reflect immediate adjustment in the 
'deprived condition, it is necessary to exclude the possibility that systematic 
.changes in formant values occurred over trials. Figure 2 shows the P1 and F2 
values for individual tokens, in order, for the vowel / i/ prqtfuced by one 
subject under the most extreme conditions '(i.e. topical anaesthesia, TMJ 
block, a 23 mm bite block and masking noise). Also .shown are the -mean F1 arid 
F2 values for thi , combined * condition, and the-,mean value of the subjects 1 
"normal" formants. * The sUpe of the line foraed by tokens one chrough nine 
-does not differ significantly from the line formed in the (non-bite block) 
control condition. Evidently, there does not appear to be a systematic 
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Figure 1. • Mean values of F 1 £' X -axis) and F2 (y-axis) for nine repetitions of 
the indicated utterances. Data for the five subjects are presented 
separately. Top: Isolated vowels: Bottom: ' /pVp/ syllables. 
M = masking noise, J = TMJ block, T = topical anaesthesia, 
BB =-bite block. ' 
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Figure 2. F, and F2 values for individual tokens, in order, of the vowel /i/ 
produced by one subject under the most extreme experimental condi- 
tion. 
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learning effect occurring over trials. We confirmed this statistically for 
all subjects by performing linear regression analyses across trials of each 
subject's productions under normal and deprived conditions* Correlations were 
converted to z-scores and t- tests performed to determine whether the slopes 
differed between the two conditions* Of the sixty analyses performed (5 
. suojects by 6 utterance types by 2 formants) , not a single one showed a 
difference in slope; t(7) values ranged from .00 to .84, j>s >.1. 

DISCUSSION 

The present data are not easily explained by current models of movement 
control that have been proposed to account for the remarkable context 
sensitivity of the speech production system. Closed-loop models-*-of trie 
central or peripheral kind--both entail an availability of reliable sensory 
information about the initial conditions of the articulators. However, our 
experiment shows that acoustically normal vowels oan be produced not only when 
the normal relationships among the articulators are changed by a bite block, 
but also when sensory information from auditory, joint, and tactile sources is 
drastically reduced as well. Furthermore, * and as other recent work also 
suggests (cf. Fowler and Turvey, 1980), "articulatory compensation" appears to 
be achieved immediately and with little or no practice; none of our naive 
subjects 1 data provided any evidence of short-term adaptation* In support of 
the latter claim, the data displayed in Figure 2 are actually from the same 
subject that apppeared to display motor learning effects In an earlier study 
(Gay & Turvey, 1 979) • 1 

Before offering an alternative interpretation of our data in terms other 
than closed- loop models, two caveats may be in order. The first is that our 
results do not necessarily refute closed- loop control when the system is in 
its normal mode, that is, when all sources of information are available. The 
second is that our paradigm in all likelihood does not completely eliminate 
peripheral information, and hence a closed- loop simulation model cannot be 
ruled out completely. Nevertheless, given the drastic reduction in 
propriospecific information we (and surely the proponents of closed- loop^ 
models) might have expected much more severe distortion of the acoustics than 
was observed here. 

In spite of these caveats, we believe that a moj^ parsimonious account of 
the 'phenomenon can be forwarded, though it is less well known in speech 
research than the servoengineering model. The account that we shall Consider 
does not, in fact, depend on whether 3ensory input about the initial positions 
of articulators is available or" not. Thus there is no requirement for one 
model when sensory afference is available and another quite different model 
when it is absent. y 

The view that we shall express for the present aata has been laid out in 
some detail elsewhere (Fowler et al., 1980; -'Kelso, Holt, Kugler, & Turvey, 
1980)* In brief, it argues that functional groupings of muscles—sometimes 
called synergies (cf. Gelfand, Gurfinkel, Tsetlin, & Shik, 1971) or 
coordinative structures (cf. Turvey, Shaw, & Mace, 1 978)--exhibit behavior 
qualitatively similar to a (nonlinear) mass-spring system* Such systems arc 
intrinsically sel f-equil ib rating in the sense that the "end- point" of thf 
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system, or its "target," is achieved regardless of initial conditions. Thus 
in normal and deafferented animals (Bizzi, Dev, Morasso, & Polit, 1978), it 
cen be shown that desired limb positions are attainable without starting 
position information, and even when the limb is perturbed on its path to the 
target. Similarly, the localization ability of functionally deafferented 
humans (Kelso,: 1977; Kelso & Holt, 1980) and individuals with the 
metacarpophalangeal joint capsule surgically removed (Kelso, Holt, & Flatt, 
1980) is not affected by altered initial conditions or unexpected 
perturbations. These data have led to the vJLew that the "target" of the 
system is not achieved by means of conventional closed- loop control; rather it 
is a consequence of the system's dynamic parameters (ma^s, stiffness, 
damping). In such a model, the only parameters that need be specified for 
voluntary movement are stiffness and resting length: Kinematic variations in 
displacement, velocity, and acceleration are consequences of the parameters 
specified, rather than controlled variables, and sensory " feedback"— at least 
in the conventional computational sense — is not required (cf. Fitch & Turvey, 
1978; Kelso, Holt, & Flatt, 1980; Kelso, Holt, Kugler, & Turvey, 1980). 

It is worth noting that the view expressed above is equally applicable to 
disruptions 4 that are static and anticipated (as in the present bite block 
experiment), and th03e that are time varying and unanticipated. For example, 
recent studies of the latter kind have' shown that "compensatory responses" of 
short latency are observed in perturbed articulators as well as in others that 
contribute to the same "vocal tract goal" (cf. Abbs, 1979, for review). 
Current theorizing, however f offers two distinct mechanisms to explain the 
system's reaction to perturbation: A predictive simulation mechanism for 
anticipated disruptions (Lindblom et al . , 1979) and a closed-loop peripheral 
feedback mechanism for unanticipated disruptions (Abbs, 1979). 

The analysis offered here views such a distinction ts redundant. 
Immediate adjustment to either type of perturbation is a predictable outcome 
of a d.ynamical system in- which muscles function cooperatively as a single 
unit. If the operation of certain variables is fixed, as in the bite block, 
or unexpectedly altered, as in online perturbation, linked variables will 
automatically assume values appropriate to the constraint relation (as long as 
biomechanical limitations are not violated). In short, dynamical systems (of 
which speech is a member) always operate in a mode that one can describe as 
"compensatory." 

Although we cannot offer a detailed description of the muscles of the 
vocal tract in terms of the style of control outlinad above, we believe there 
are some grounds for optimism. Fujimura and Kakita (1979), for example, have 
performed a three-dimensional simulation of the tongue that uses quantitative 
control of contractile forces of the muscles actually involved. By treating 
the tongue muscles (in this case the posterior and anterior portions of 
genioglossus) as a cooperative unit and maintaining the relative magnitude of 
contractile inputs to each muscle, it can be shown that the acoustic pattern 
for the vowel /i/ is obtainable with a wide variety of absolute force values. 
Thus, as long as the contractile balance among linked muscles is preserved, 
the exact magnitude of muscle contraction (beyond a critical value) does not 
matter (see also Kakita & Fujimura, 1977). The generality of this model is 
limited, at this time, to a single point vowel. Nevertheless, the nonlinear 
relationship between muscle forces and acoustic pattern allows, or rather 
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provides for, a context- conditioned production system. As in recent accounts 
of limb localization, invariant "targets" can be attained with different 
stiffness specifications, as long as the balance in stiffness among relevant 
muscles is preserved. 

As a final point , the analysis offered here suggests a commonality in 
function between the system capable of producing vowels and that involved in 
the attainment and maintenance of limb postures. Both systems are materially 
distinct from each other but share behaviors qualitatively like a nonlinear 
masa spring.' The nontrivial claim, then, is that speech and limb movements 
are dynamically alike in sharing a common solution to the equifinality 
problem. 
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FOOTNOTE 



1 1ndeed it was after observing the bite block performance of our naive/ 
subjects under reduced information conditions' that this person offered to 
participate in the present experiment. This was a magnanimous gesture for 
which we express our gratitude. 
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THE STREAM OF SPEECH* 

Robert E. Remez+ and Philip E. Rubin 



Abstract . The use of sinusoidal replicas of, speech signals reveals 
that listeners can perceive speech solely from temporally coherent 
spectral variation of nonspeech acoustic elements. This sensitivity 
to coherent change in acoustic stimulation is analogous to the 
sensitivity to change in configurations of visual stimuli, as 
detailed by Johansson. The similarities and potential differences 
between these two kinds of perceptual functions are describes 

Studies have shown that the continuously changing stream of speech can be 
approximated by a kind of acoustic animation, at the theoretical heart of 
which is an idealization of the human vocal tract as a resonar.t horn (e.g., 
Chiba & Kajiyama, 19M1; Fant, 1956, I960; Stevens & House, 1955). The 'details 
of the acoustics of speech can thereby be explained by noting that the vocal 
horn cam- be constricted at different places along its length, that it may be 
multiply excited, and, especially, that its shape can.be changed rapidly. In 
practical situations, such as speech synthesis, the assumption of the hornlike 
properties is tacit, presupposed; the synthesizer speaks by the excitation of 
a lumped-circuit resonator (or its digital equivalent), which is itself 
approximate to horns of many types, including vocal tracts. 

Although the term "speech stream" is often used to refer to the acoustic 
products of human vocalization, speech has commonly been studied by conceiving 
this metaphor ic stream as an imbrication of more or less isclable elements, 
such as steady-state or transitional formant patterns, plosive bursts, band- 
limited noise, and stretches of silence. In our perceptual accounts, then, 
the exclusive attention to perceptual effects of specific elements in the 
acoustic pattern has led us to undervalue the coherence of the speech stream. 
In contrast to theoretical characterizations of the speech stream emphasizing 
structural continuicy (Liberman, 1970), experimenters find it quite' agreeable 
in practice to treat the perceptually relevant acoustic structure as if it 
consisted of distinct elements. Within this framework, properties of change 
in the speech signal figure primarily as a problem; from a dynamic array, the 
listener must somehow extract static elements or cues, perhaps even by means, 
of a specialized decoding device. However, research continually reveals that 
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Figure 1. A) The English sentence, "My dcg Bingo ran around the wall. 11 The signal was produced 
by a natural talker and was quantified by a spectrum analyzer employing a bank of 
acoustic filters. B) The same sentence is represented here as a sequence of center- 
frequency values of the first three formants. The method of Linear Prediction was 
used to prepare this series of spectral sections at 10 msec intervals. C) The same, 
sentence is realized here .in a signal composed of three time-varying sinusoids. The 
pattern of relatively continuojs change of the natural signal is retained, while the 
broadband formants and harmonic structure of the natural speech signal are eliminated. 
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perceivers care very little" about the momentary attributes of speech signals. 
Even under direct test, and in highly favorable conditions, listeners seem 
unable to report acoustic properties of the stimulation on which their 
phonetic percepts are reliably based (Best, Morrongiello, & Robson, 1981; 
Pisoni, 1971; Mattingly, Liberman, Syrdal, 4 Halwes, 1971). 

To describe the speech stimulus in a manner appropriate for perception, 
we must therefore characterize the time-varying spectrum of the acoustic 
pattern. In doing so, we elaborate the acoustic coherence of the speech 
stream, and avoid reducing it to a sequence of static acoustic moments 
irrelevant to the operating principles of the perceptual system. Only in this 
fashion may we gain a clue about the "smart" perceptual processes (Runeson, 
1977) applied" to speech stimuli. In addition, the success of the discrete-cue 
approximations of speech would then be explained by observing that a sequence 
of cues, however it reconstitutes properties of coherent change in the speech 
stream, is perceptible because it conserves the important properties .of 
acoustic variation , rather than because it conserves appropriate short-time 
spectra or discrete acoustic elements. 

RECENT EVIDENCE 



Although similar notions have be*>n expressed from time to time in speech < 
research (e.g., Liberman, 1970), several recent experiments especially promote 
these speculations about the importance of spectrum variation in speech 
perception (P*nez, Rubin, & Carrell, 1981; Remez, Rubin, Pisoni," & Carrell,. 
1981) In these studies, listeners perceived phonetic segments from acoustic 
stimuli consisting solely of two or three sinusoids. Frequency and amplitude 
variations of the sinusoids imitated the changes of „he vocal resonances found 
in natural speech signals (see Figure 1). Specifically, each sinusoidal tone 
was matched in amplitude and frequency to one of the formants,.or resonances, 
of a natural utterance that served as a model. Matching values were 
determined 1 for successive 10 msec sections of the natural utterances, and 
these values were used to control a sinusoidal synthesizer. 

None of the acoustic cues typically believed to underlie phonetic 
perception was present in the sinusoidal patterns,: neither formant transi- 
tions -nor steady state formants, because there were no broadband vocal 
resonances 'in this three-tone signal; nor fundamental frequency, because the 
three-tones were not harmonically related; nor band-limited noise, because the 
signal had only three periodically unrelated components. Thus, the short-time 
spectra of the signals did not satisfy the amplitude and frequency require- 
ments of the spectral templates that are sometimes claimed to be useful in 
analyzing the acoustic pattern into phonetic units (Stevens & Blumstein, 
1981) Despite this absence of vocally producible constituents, the overall 
pattern of frequency and amplitude variation imitated natural acoustic pat- 
terns. Listeners who- found these sinewave replicas of speech to be intelligi- 
ble evidently disregarded the inappropriate momentary acoustic structure and 
were untroubled by the lack of traditional acoustic cues. Rather, they must 
have attended to the coherence of the time-variation of the tones which 
betrays the vocal origin of the signal, and, at the same time, specifies an 
impossible sounding voice. 
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Unlike vocal resonances that share the same laryngeal source, each 
sinusoid is acoustically independent, and listeners readily reported this 
distinctness. Accordingly, it was rare, that n&ive listeners spontaneously 
attended to phonetic information In this grossly unnatural phonetic carrier. 
Listeners, told nothing in advance about the three-tone signals, heard them 
simply as three simultaneous tones, modulated asynchronously as if in three- 
part counterpoint. However,, the simple instruction to listen for a sentence 
enabled almost 70%* of naive listeners to detect £ sizable chunk of the 
message, if not its entirety. In other words, listeners made use* of phoneti'c 
information that was exclusively time-varying in nature, in the absence of 
short-time spectra characteristic of vocalization. 

AN ANALOGY WITH VISUAL EVENT PERCEPTION 

The analogy is readily apparent between our experiments on phonetic 
perception from sinusoids, and Johansson's (e.g., 1975) experiments on the 
perception of locomo- ory >and other movements from point-light displays. In 
both cases, it appea-<* »*hat the pattern of. coherent change in the stimulation 
conveys information &> but the c/ent in progress. In Johansson's case, visual 
displays are made by videotaping a human figure moving in the dark, illuminat- 
ed only at the joints of the articulating limbs. Although it is impossible to 
identify the content of the dot display from the single snapshots, th£ moving 
dots pY light convey a wide range of subtle locomotory information. It is 
this organized change in the constellation of lights that carries information 
about the walking actor, despite the absence of static information to reveal 
which light belongs to which joint. Our sinewave element is like a 'speech 
formant in the same way that Johansson's light spot is like a « radiocarpal 
joint — the value of each element is established only by virtue of the coherent 
configuration to which it belongs . 

The analogy is not perfect, though. The distal object for Johansson's 
subject was a walker who seemed to mean nothing by his walking. In contrast, 
the distal dbject for our subjects was a message spoken by a, strange talker. 
Our subjects perceived a highly structured message, and Johansson's did not. 
But, this may merely be a superficial methodological discrepancy if the visual 
observer can perceive whether the person in the display is performing a tango 
or a fox trot; or' whether the person is using body "language" or American Sign 
Language, and what the message is (Poizner, Bellugi, & Lutes-Dr iscoll , 1981). 
In each case, then, the perceiver identifies a person (one talking, one 
dancing), a structured transformation (one linguistic, one terpsichoric) , and 
a strange medium (one sinuisoidal, one dotty). 

There is an additional discrepancy between Johansson's paradigm and ours/ 
Subjects seem to find the information in the moving dot displays to be more 
accessible than the information in the sinusoidal displays. We do not 
understand this very v/ell, but the fact that so many naive subjects hear 
sinusoids phonetically when instructed to do so may reduce the significance of 
this difference between the visual and linguistic 'cases . • 
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A POTENTIAL FORMAL DIFFERENCE 



t 

In view of the^ similarity, it seems appropriate to distinguish formally 
some properties of visual m'otion perception and speechl perception. Faced with 
the task of describing the geometry of optical flow, Johansson writes, "the 
self-motion component in the proximal flow (is combined with* tremendously 
'complex object motion flow* The result is from a mathematical point of view^ 
like a chaos or is at least mathematically complex to absurdity 11 * (Note 1, page 
6), Of course, the perceiver often, disentangles the x various components easily 
despite the limitations we otherwise experience 4 as t descriptive geometers* 
Acoustic change in the case of speech may not prove quite so elusive to 
describe, though. In principle, the physical limits of the variation of 
speech sounds are far narrower^-conatrained anatomically and linguistically — 
than are the physical limits of optical flow, which appear to be set, after 
all, by mechanics. And, we have a tremendous head, start of forty years of 
"ecological acoustics" of the vocal tract. In any event, we suggest that 
studying the coherence of the - speech stimulus — describing the stream of 
speech — requires^ a change of. emphasis that brings research on phonetic 
perception closer to the approach established by Johansson for studying the 
visual perception of events. _ v _. 
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USING' THE ACOUSTIC SIGNAL TO MAKE INFERENCES ABOUT PLACE AND DURATION OF 
TONGUE-PALATE CONTACT k 

pT J. Price* * 



Abstract . Productions of /t/ and /d/ in various intervocalic 
contexts by four native speakers of American English were examined. 
Closure duration was 1 measured from the acoustic signal and 'from 
dynamic palatography data* Place of contact was observed from the 
palatographic signal, and formant frequency measurements in the 
closure interval were made. Measurements were averaged over six 
productions of each stimulus by each talker. Duration measurements 
by the two techniques correlated well, except (as expected) in 
instances in which' the second vowel had a glottal onset/ in w»iich 
c case the duration measurements in the acoustic domain were lpnger 
than the durations measured palatographically. . Place of contact 
correlated well with F3 measurements.. Normalization across talkers 
was achieved by dividing each F3 measurement by the talker's mean 
F2. 



INTRODUCTION 

Dynamic palatography provides an excellent means to observe tong-3-palate 
contact. This technique is "more direct than computing aren functions or 
relying on articulatory introspection. On the other hand, dynamic palatogra- 
phy is less widely available than area function algorithms or introspective 
data. Dynamic palatography is, thus, of interest both in making direct 
measurement and in providing a reliability measure for analyses that can be 
carried out with a wider range of subjects and conditions. The latter is the 
subject of the present report. 

Both closure duration (duration of tongue-palate contact here) and place 
of constriction are important in the classification of speech sounds. Closure 
durations can distinguish flap from stop articulations (see, e.g.,, Port, 1976; 
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Zue & Laferriere, 1979), voiced stops from voiceless stops (see Lisker, 1957), 
and fricatives from affricates (see Dorman, Raphael, & Liberman, 1979). Place 
of articulation is built more directly, into our classification system: 
Consonants are classified as labial, palatal, or velar depending on where the 
tqngue * makes contact with the palate. The artificial palates used oermit 
analysis of articulations from just behind the front- teeth to just before the 
soft palate.' Thus, the method is best s r uited for, articulations iti the 
alveolar to the palatal region. Place resolution is to within about 4.5 mm 5 
time resolution is to within 16 msec. 



METHOD 

Subjects 

* 

The study employed four talkers (TB, FB, LL,.LR), all native speakers of 
American English. FB is female; the others are '*ale. FB' and~TB are practiced 
talkers with the artificial palate; LL and LR u~ d their artificial -palates 
for the first time in this experiment. 

> Stimulus Materials N ' 

The stimulus times (shown i.rr Figure 1) were read from a randomized list. 
This list includes items in which [t], [d], and flap [£] occur in intervocalic 
position. Other items appeared in the list but will not be discussed^ here. 
The items are grouped by stress environment and by likelihood 1 * occurrence of 
flap rather than stop articulation^. Flaps can be defined articulatorily by 
the quick gesture of the tongue £ip in the direction of the alveolar ridge 
associated with them. In American English, flaps typically occur ii) "latter" 
and "ladder," but not in "adorn"^ or "atone." In some contexts, flaps may 
alternate with stops. These items', therefore, occur t,wice in the list; they 
ar^e, distinguished by parentheses. Each talker produced six examples of each 
item". 

M easurement Procedures 

— — t 

A 

The artificial palate used in dynamic palatography is about 1.5 nun thick 
and fits over the hard palate of the talker. It is embedded with 63 gold 
electrodes configured as in Figures 2a and 2b. Sampling occurs every 15.6 
msec. These data can be stored and played back for a frame-by-frame analysis 
by means of the RION Model DP-01 electropalatpgraph. The electrodes are about 
M to 5 mm apart, depending on the size of the talker's palate (4.5 nun is used 
as an estimate here). 

Measurements related to closure duration and place of contact were made 
both acoustically and paiatographically . For the purposes of this study, 
duration and place of closure were defined as follows. Complete closure was 
defined oy contact with at least one electrode in 10 of the 11 columns labeled 
in Figure 2a. The definition of completeness of closure was flexible since 
the 15.6 msec sampling interval of the palatograph was somewhat long relative 
to flap durations (see, e.g., Fisher & Hirsh, 1976; Lisker & Price, 1979; 
Port, 1 9 76 ; Zue & Laferriere, 1979), and since the electrodes ai 3 tuned a^a 
group ra,her than individually. Closure duration as measured from the palate, 
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WORD LIST 



TEST ITEMS 

FLAPPING ENVIROMENTS 

post- stress heating, heeding 

heater , heeder, heat 'er. heed 'er 

hotter , solder , water 

(Toto), ( dodo ) 

butter 

(todo) 

addict (noun) 
potty , toddy 

pre -stress at A, ad A 
at lye 

NON- FLAPPING ENVIROMENTS 

post -stress tutu ( todo ) 

(Toto), ( dodo ) 

pre -stress atone , a tone, adorn 
to do, todo 
g dive 

addict (verb) 



a day 



OTHER ITEMS 



halter, all day , saunter , center , sender, 
party, tardy , hearing , healing , hearer , heole r , 
horror , holler , sorrow , solo , guru, zulu, 
hurry, holly, array, allay, a roan , alone 



Figure 1 . Word list. 
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"COLUMNS" OF THE PALATOGRAPH 




ROW I 
ROW 2 



o o o 



°o° 0 o°o Q o° 0 0 

0 o0 0o ° o0 o 0 8 
o o 



"ROWS" OF THE PALATOGRAPH 



Figure 2. Configuration of the electropalatograph. a. Columns, b. Rows. 
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then, was the number of frames of complete closure (as defined above) 
multiplied by the 15.6 msec frame duration. Closure duration was measured in 
the acoustic domain by demarcating visually the amplitude dip in the waveform 
and F1 excitation in the spectral analysis. Place of contact was measured 
from the palate by taking the mean of the front-most and rear-most rows 
touched, where "rows" are defined as in Figure 2b. In order to estimate place 
of contact from the acoustic analysis, frequency measurements of the second, 
third, and fourth formants were made from spectral cross-sections taken at the 
end of the acoustically defined closure interval neighboring the stressed 
vowel . 



RESULTS 

Duration of Contact 



Figure 3 plots the duration measurements made from the palatograph as a 
function of those made from the acoustic analysis. For items that were 
pronounced with flaps in all six productions by a given talker or with stops 
in all six productions, each point represents an average over those six 
productions by each talker. For items in which flaps alternated with stops in 
a given lexical item, two points occur, one for the flap articulations and one 
for the stops. In these cases, the points represent averages over fewer than 
six productions. The dashed straight line indicates where all points would 
fall if the two measurements were in perfect agreement. 

The triangles in Figure 3 represent the set of tokens judged auditorily 
to have been produced with a glottal attack on the second vowel. All of these 
tokens occurred in the V-V environment. It is reasonable that acoustic 
analyses should yield longer closure duration measurements than the palato- 
grapKc measurements since the low amplitude of the waveform and the lack of 
strong F1 may result either from closure of the upper vocal tract or from 
glottai control. The perceptual salience of the glottal attack indicates that 
the glottis is involved. This figure indicates that some of the flaps may be 
"glottalized"; note the group of circles with waveform durations of 20 to 40 
msec and palate durations of 0 to 20 msec. Very short durations are 
problematic since the palatograph samples only every 15-6 msec. The averaging 
over six tokens should, however, moderate this limitation, and, further, some 
of the differences in the two measurements are lafger than the expected 
maximum difference of about 15 msec. What is more, the amplitude dip in these 
tokens is somewhat greater than would be expected if upper vocal tract closure 
were the only mechanism responsible. Thus, it is likely that at least some 
flaps produced by these talkers involve some glottal control. 

There is some indication that the artificial palate may interfere 
somewhat with articulation (see Hamlet & Stone, 1978). Thus, some caution 
must be used in interpreting data collected palatographically : Items that 
sound unnatural may be made with articulations representing more usual habits, 
while those that sound natural may be made with articulations that reflect 
adaptive strategies. That the palates used are only 1.5 mm thick minimized 
but did not eliminate this problem. However, the primary purpose of the 
present experiment is to map out some acoustic correlates of certain articula- 
tions, and for these purposes the limitation is not so crucial as it would be 
if naturalness were required. 65 
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Figure 3. Correlation of duration measurements from waveform with those from 
palatograph. Each circle is an average over six items produced by 
a given talker, except in cases where flaps alternate with stops. 
The dashed line indicates where all points would fall if the two 
measurements were in perfect agreement. Triangles represent the 
glottalized items. 
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In sum, though one must be aware of the limitations of the method, it is 
possible to make fairly good closure duration estimates from acoustic ana- 
lyses. This method accounts for 59% of the variance. Further, if the tokens 
that are perceived as strongly glottalized are removed, 88% of the variance is 
accounted for. 

Place of Contact 

The characterization of retroflex sounds by lowering of the third and 
fourth formants has been noted (Fant, 1973; Ladefoged, 1975). That American 
English flap has not typically oeen described as retroflexed may be due more 
to its phonological role (its alternates with It] and td]) than to its 
perceptual or acoustic qualities. Similarities of American English flap with 
sounds that are described as ratroflex have been pointed out (see Monnot & 
Freeman, 1972, for a comparison with Spanish single-tap /r/; see Mori, 1929, 
for a comparison with Japanese flap /r/; see Price, 1981, for a more general 
discussion). Since the flap productions collected for the present study were 
acoustically similar to the descriptions in the above-mentioned Ladefoged and 
Fant references, and since there was corroboration of at least a more 
posterior place of contact than for tt] or td], it seemed reasonable to 
measure F3 and F4 as a potential measure of degree of retroflexion. 

The glottalized tokens were omitted in this analysis, since the place of 
contact and the point at which F3 and F4 measurements were taken do not line 
up temporally. The stop articulations (tt] or td]) were also omitted, since 
the first electrode of the palate was not front enough to measure place of 
contact accurately: recall that the technique involved the average of front- 
most and rear-most contact, and stops are produced with a wide area of contact 
that spreads to the teeth where no electrodes are located. The analysis, 
thus, concerns the set of all flap articulations. 

Figure 4 shows the palatographs place measurements plotted as a function 
of the third and fourth formant frequency measurements for the practiced 
talkers (FB and TB) . Both F3 and F4 for both talkers correlate rather well 
with place of articulation as measured from the palatograph: 72-74% of the 
variance is ac.^unted for in each case. F3 was selected for future measure- 
ments, however, since it is easier to track and measure than F4. 

These data show that reasonable place estimates can be made for a given 
talker based on formant frequency measurements. Is it possible, though, to 
make comparisons across talkers? Absolute frqeuency will not do, for example, 
since an F3 measurement in the 2000 to 2500 Hz range represents the front-most 
articulation for talker TB; the rear-most for talker FB. Multiplying these 
frequency measurements by any constant that represents a given ' talker ' s 
formant frequency range might make normalization possible. Mean F2 was used 
here and appears to have worked fairly well. 

Figure 5 shows how well this normalization worked for all flap produc- 
tions by all four talkers. Note that the female talker 1 s productions are now 
within the range of those of the three male talkers. The cluster of points 
above 1.!) along the x-axis are flaps in the environment of front vowel /i/. 
Again, the limitation of the placement of the front-most electrode probably 
makes these articulations appear further back than they are. The correlation 
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coefficients are somewhat higher for the individuals than for the entire 
group. Yet the group coefficient of -.78 (accounting for 61* of the variance) 
shows that fairly accurate place estimates can be made from acoustic analysis. 



" CONCLUSIONS 

Subject to the limitations already mentioned, measurements in the acous- 
tic domain can be used to infer both place and duration i of tongue-palate 
contact. 
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PATTERNS OF HUMAN INTERLIMB COORDINATION EMERGE FROM THE PROPERTIES 
OF NON -LINEAR LIMIT CYCLE OSCILLATORY PROCESSES: THEORY AND DATA* 

V 

J. A. Scott Kelso* , Kenneth G. Holt, Philip Rubin, and Peter N. Kugler 



A bstract . The present article represents an initial attempt to 
offer a principled solution to a fundamental problem of movement 
identified by Bernstein (1967), namely, how the degrees of freedom 
of the motor system are regulated. Conventional views of movement 
control focus on motor programs or closed-loop devices and have 
little or nothing to say on this matter. As an appropriate 
conceptual framework we offer the physical theory of homeokinetics 
of Iberall and, his colleagues elaborated for matters of movement by 
Kugler, Kelso, and Turkey (1980). Homeokinetic theory characterizes 
biological systems as ensembles of non-linear oscillatory processes, 
coupled and mutually entrained at all levels of organization. 
Patterns of interlimb coordination may be predicted from the proper- 
ties of non-linear limit cycle oscillators. In a s?t of experiments 
and formal demonstrations we show that cyclical, two-handed move- 
ments maintain fixed amplitude and frequency (a stable limit cycle 
organization) under the following conditions: (a) when brief and 
constantly applied load perturbations are imposed on one hand or the 
other, (b) regardless of the presence or absence of fixed mechanical 
constraints, and (c) in the face of a range of external driving 
frequencies from a visual source. In addition, we observe a tight 
phasic relationship between the hands before and after perturbations 
(quantified by cross-correlation techniques), a tendency of one limb 
to entrain the other (mutual entrainment) and that limbs cycling at 
different frequencies reveal non-arbitrary, sub-harmonic relation- 
ships (small integer, subharmonic entrainment). In short, all the 
above patterns of interlimb coordination fall out of a non-linear 
oscillatory design. Discussion focuses on the compatibility of 
these results with past and present neurobiological work, and the 
theoretical insights into problems of movement offered by homeoki- 
netic physics. Among these are, we think, the beginnings of a 
principled solution to the degrees of freedom problem, and the 
tentative claim that coordination and control are emergent conse- 
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quences of dynamical interactions among non-linear, limit cycle 
oscillatory processes, 

1., INTRODUCTION 

The beginning of the 1970s brought about a remarkable change in the 
approach of psychology and related disciplines to the area of movement 
behavior. This sQrcalled product to process shift that mqny have remarked on 
(e.g., Pew, 1974; Schmidt,' 1975) eVolved from emerging models of human 
information processing ' and empirical attempts to discover the nature of- a 
myriad of hypothetical processes — stimulus detection, memory retrieval ,° re- 
sponse selection (to name but a few) — thought to be involved in behavioral 
action. Even more significant was the embracing by<. psychologists of control- 
theoretic and cybernetic concepts; a -move that led to models of motor skill 
learning (Adams, 1971; Schmidt, 1975) and memory (cf. Laabs, 1973; Stelmach, 
1 974 ) and a great deal of laboratory activity (for updates and developments of 
closed-loop theory, see Adams, 19,77; for updates on schema theory, see Shapiro 
& Schmidt, 1982). 

At the beginning of the 1980s it seems timely tb remark that our theories 
and models (like many of the theories and models in biology and the social 
sciences) are flawed by a deep-seated anthropomorphism that extends back to 
the time of Descartes: acting humans are compared to machines (computers and 
servomechanisms) provided with means of control and self-regulation. Motor 
control theories are peppered with anthropomorphic concepts — perceptual 
traces, reference mechanisms, comparators, schemas, programs, and so forth — 
created to "explain" data. Although these concepts have been, and probably 
will remain, useful for developing an intuitive idea of the way motor systems 
work, we believe it is now time to consider a dynamical account of movement 
behavior — one that is consonant with the newly emerging physics of living 
systems (cf. Kelso, 1981; Kelso, Holt, Kugler, & Turvey, 1980; Kugler, Kelso, 
& Turvey, 1980, 198^; Turvey, 1980) — even if it is at the' expertise of some 
abstraction. The theoretical approach offered here is in its infancy, but the 
need for it and the types of experiments motivated by it (some of which we 
report here) can be clarified when we consider further some of the shortcom- 
ings of existing theory. 

2. SOME LIMITATIONS OF CURRENT THEORY 1 ■ 

The problem — aptly understated — that has not received as much theoretical 
attention in the movement domain as it warrants, is one shared by all living 
systems; namely, how the internal degrees of , freedom of the system are 
regulated (cf. Bernstein, 1967; Iberall & McCulloch, 1969). Recently we and 
others have addressed this problem in some detail (e.g., Fowler, Rubin, Remez, 
& Turvey, 1980; Greene/ 1972, 1978; Kelso, 1981; Kelso & Holt, 1980; Kelso et 
al., 1980; Kugler et al., 1980, 1982; Turvey, 1977, 1980; Turvey, Shaw, & 
Mace, 1978). One suspects that a main deterrent to a biologically motivated 
solution to the degrees of freedom problem lies in the view — shared by many — 
that humans, like computers, are simply information processing devices 
(cf. Berlinski, 1976). Theoretically, so the argument goes, computers can 
perform any calculable task that humans perform. Thus it should be possible 
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to retrace the putative sequential steps that a human takes in solving a 
problem and instantiate them in program form. The physical realization of 
'such a program would exhibit behavior that was artificially intelligent in the 
sense that such behavior would be indistinguishable from human performance. 
But, as Berlin ski ( 1976) emphasizes, such a claim rests on a fundamental 
error. Just because computers can simulate certain types of human performance 
is hardly evidence that humans actually do employ such programs. Searle 
( i 98O ) takes a similar stance in noting that the feature that seemed so 
attractive to people in artificial intelligence, namely the distinction 
between the program end its realization, proves fatal to the claim that 
simulation qualifies as duplication . Thjs in Searle 1 s (1980) terms: 

The same program could be realized by an electronic machine, 
Cartesian mental substance or a Hegelian world spirit... If mental 
operations consist of computational operations on formal symbols, 
then it follows that they have no interesting connection with the 
brain [for] the brain just happens to be one of the indefinitely 
many types of machine capable of instantiating the program. This 
form of dualism... is Cartesian in the sense that it insists that 
what is specifically mental about the mjlnd has no intrinsic connec- 
tion with the actual properties of the brain (pp. 423-424). 

A second and related point is that both programming and cybernetical 
solutions to the degrees of freedom problem vastly undermine^ U)e dynamics of 
the structure to be controlled. They fail, as Yates, Marsh, and Iberall 
(1972) remark, the "test of matching" : in order to couple a control device to 
the system being controlled there, must be some match between scales of energy 
or mass for efficient operation to ensue. In short, computational or 
algorithmic solutions place their emphasis on the small signal, information 
aspects of the system but undervalue the equally important energy-converting 
machinery (the power fluxes, Yates et al., 1972). A viable account of the 
dissipation of degrees of freedom' for motor systems should recognize the 
mutuality between informational and power sources. 

Much of current theorizing on motor ' behavior perpetuates a conceptual 
chasm between the brain as the source of signals for coordination and control 
and the high power, energy-converting muscular system that is the putative 
recipient of such messages. For example, in neurophysiological and behavioral 
studies of movement, it is common for investigators to assume that the 
products of motor function are isomorphic with the underlying (brain) 
processes from which those products derived. If the movement of an animal 
terminates at some spatial location, within a set period of time, for example, 
that spatial location and duration are said to be contained or represented in 
the animal's motor program. The motor program then is viewed as causally 
responsible for generating the spatial position of a limb and metering out the 
ime it A takes for the limb to get there. 

However, we suspect in real biological systems— unlike formal systems- 
there ma/ be no need to represent explicitly every detail in the behavioral 
sequence. Rather, sequential organization may be due primarily to dynamical 
laws and the existence of constraints thac serve to guide those dynamics 
(cf Pattee 1977). If this view is correct, then the order and regularity in 
movement behavior that we observe will not be dje to an a priori prescription— 
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in terras of programs or reference levels— that is independent of and causally 
antecedent to the motor activity in question. Nor will it be an isomorphic 
representation of the behavior to be- explained. Rather, spatiotemporal 
organization — the dissipation of degrees of freedom — will arise as an a 
posteriori fact, an emergent property that is a consequence of, and concomi- 
tant with the dynamical behavior of the system (cf. Fowler, 1977). 



3- THE DYNAMIC ALTERNATIVE 

The answer to what Bellman (1961) called "the curse of dimensionality"-- 
the problem of understanding the relationship between informational and power 
processes — is offered in a recourse to dynamics, defined as the physics of 
motion and ^.ange. If, ^as we assume, living systems obey the laws of physics 
(though they are not readily reducible to them) then, given that formal 
machine concepts may provide an inadequate basis for complex behavior, what 
can a dynamical explanation offer in their place? To provide a reasonable 
answer ,to this question we have to be armed with certain physical concepts 
that apply to active living systems. 

In the past, a physical description of biological processes has been 
deemed inappropriate because dynamics has dealt almost exclusively with the 
behavior of closed, entropic systems (i.e., systems tending towards randomness 
and disorder) . Thermodynamic law states that in a closed system the tendency 
toward entropy will increase to a maximal value, and that the process is 
irreversible. In contrast to closed, physical systems, living systems are 
"open," t>.y virtue of their ability to capture, degrade, and dissipate free 
energy. As Schroedinger (1945) remarked, living systems "accumulate negentro- 
py ff and iri so doing maintain their structure and function. 
\ 

It is only recently that an adequate physics has developed to accommodate 
the facts of biological systems. Following the lead of Prigogine (1976; 
Prigogine & Nicolis, 1971) and Katchalsky and Curran (1967), Morowitz (1979) 
has provided argument that continuous energy flow through a living system 
constitutes its chief distinguishing feature. In order to prevent the drift 
towards static equilibrium, biological systems must perform work. Since an 
isolated, closed system cannot do steady work, it must be connected with a 
source and a sink; and it is the flow of energy from the source to the sink 
that constitutes work. Energy flow, per se^ is the chief organizing factor of 
living system3 (cf . Morowitz, 1979) . 

j All this may seem far removed from a theory of movement, but it leads us 

{. to one fundamentally important principle that follows from Morowitz's (1979) 
f main Theorem (p. 33): the flow of energy th rough the system from a source to 
a sink will lead to at least one cycle in the system . It is the notion that 
cyclicrtirprovides a dynamic basis for investigating (and understanding) motor 
systems i,hat we turn our attention to next. 
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4. CYCLICIIY , WITH SPECIAL REFERENCE TO HOMEOKINETIC PHYSICS 

(Iberall, 1977, 1978; Iberall, Soodak, & Hassler, 1978, 
Soodak & Iberall, 1978; Yates, 1980KYates & Iberall, 1973). 

Persistent cyclicity in biological systems is a non-linear phenomenon; if 
it were not, the strictures of thermodynamics would ensure a steadily decaying 
function. Consider, for example, simple mechanical systems such as a mass- 
spring, in which the equation of motion describes a trajectory towards an 
equilibrium state. Such systems may be described by a second order differen- 
tial equation as follows: 

mx + cx + kx = 0 ^ 1 ) 

In Equation 1, oscillatory motion will decay at a rate proportional to the 
magnitude of the viscous (frictional) term c. This fact is predicated upon 
the second law of thermodynamics—time flows in the direction of entropy. Yet 
as we have noted above, living systems are characterized by sustained motion 
and persistence; they are not statically stable; rather they maintain their 
form and function by virtue of their dynamic stability. How then, can we 
ensure sustained motion without violating thermodynamic law? 

Consider again the familiar mass-spring equation, but this time with a 
forcing function F(0): 

mx , * cx + kx = F(0) (2) 

It is not enough to supply energy to the system described in Equation 2. It 
must also be supplied at the right place and time in the oscillation. 
Moreover, the forcing function must exactly offset the energy lost in each 
cycle for sustained performance to occur (i.e., to satisfy thermodynamic 
strictures) . Many real systems meet this requirement by employing a 
mechanism— called an escapement— that releases exactly the energy needed to 
compensate for dissipative losses. 2 The escapement .consists of a non-linear 
element that taps energy from a high potential source— as long as it lasts— to 
overcome local thermodynamic losses. Thus a pulse or "squirt" cf energy is 
released into the system via the escapement such that, averaged over cycles, 
the left hand side of Equation 2 equals the right hand side, thereby ensuring 
sustained motion. Such cycles are called limit cycles because they are 
capable of returning lo a stable mode regardless of disturbances that may 
speed up or slow down the cycle (see below for further details of limit cycle 
properties) . 

Real clocks are non-linear limit cycle oscillators, that once started, 
have self-sustaining" properties (cf. Andranov & Chaiken, 1949). Many investi- 
gators of movement have hypothesized the existence of "clocks" or "metronomes 
for purposes of timing (see Keele , 1981, for most recent review) and the 
rhythmic structure of many biological systems is beyond question (cf. /setoff , 
1979; Oa;ley & Goodwin', 1971 ) as is the existence of social -cultural rhythms 
(e.g., Brazelton, Koslowski, & Main, 1974). However, neither in the field of 
chronobiolo&y nor in the motor systems area are such hypotheses based firmly 
in physical theory. Cyclicity (clock-like behavior) in a system arises as a 
consequence if the transfer of energy from a high potential source to a low 
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potential sink. Cyclic iiy i*s ubiquitous in all complex systems, as Yates 
(1980) has emphatically remarked, because it is an "obligatory manifestation 
of a universal design principle for autonomous systems." 

What does a physically-based theory of periodic phenomena buy us in terms 
of a principled approach to the coordination and control of movement? The 
historical antecedent to modern models of the regulation of behavior is the 
Bernard-Cannon principle of homeostasis. That is, the int: -nal states of an 
organism are preserved at "^quil ibriun despite changes in the external 
environment . Modern feedback theories (mbdeled on quasi-linear 
servocnechanisms) extend th*s notion of a reference signal-as-goal state to one 
that caa be achieved and adjusted through processes of detection, comparison, 
and error correction3 (cf. Adams, 1971, 1977; Schmidt, 1975). In sharp 
contrast, the physical scheme ttiat we outline here is horoeokinetic (Iberall, 
1970). It is the operating conditions of a configuration of interacting, non- 
linear , limit cycle oscillatory processes that determine the stability (or 
regulated state) of the organism. According to Iberall and his colleagues 
(Iberall, 1977, 1978; Iberall et al.,,1978; Yates & Iberall, 1973) stability 
in self-organizing autonomous systems (the living kind) > arises when many 
cyclical processed become entrained. Thus we can conceive of systemic 
behavior as being established by' an ensemble of non-linear components that are 
entrained itito a coherent configuration. 

Elsewhere we have reviewed and presented evidence for the nation — 
-stemming iVom Bernstein's C 1 9U7 > initial insighta — that a group of* muscles, 
functioning as a unit, exhibits properties qualitatively like that of a- non- 
linear oscillatory system (of. Asatryan & Fel'dman, 1965; Fowler et al. t 1980; 
Kelso, 1977; Kelso & Holt, 1980; Kelso, Holt, & Flatt, 1980; Kelso et al . 
J980, for review). Briefs', we have shown that limb movements may be 
terminated accurately despite unexpected changes in initial conditions, unpre- 
dictable load disturbances during the movement trajectory, functional deaffer- 
entation, and all of ^hese in combination. These results have been widely 
accepted both in animal and human work (e.g., Cooke, 1980; Polit-& Bizzi, 
1978; Schmidt, 1980) and are interpreted — to^_a first approximation — as evi- 
dence for a mass-spring system (see al so-Hollerbach, 1980, for expansion of 
this view to cursive handwriting). But linear mass-spring systems cannot 
exhibit homeokinetic properties even though they are capable gf displaying 
periodicity. That is ^to say,* the only cycles that meet the non-linear and 
self-sustaining criteria of biological. Systems are limit cycles (cf. Goodwin, 
1970; Yates, 1980; Yates £ Iberall, 1973). A brief discussion of limit cycle 
properties is thus in .order, since it is these that provide a deductive 
framework for the present experiments. 



5. PROPERTIES OF (NON-LINEAR) LIMIT CYCLES 

(cf. Minorsky, 1962;4>avlidis t 1973; Sol'berger, 1965 ) 
. «** 

To reiterate, briefly, the central feature of homeokinetic physics is the 
dynamic regulation of a system's internal degrees of freedom by means of 
coupled ensembles of limit cycle oscillatory processes. In contrast to 
program and cybernetical conceptions, homeokinetic physics views the existence 
of acti^t. , interacting components and large numbers of degrees of freedom as a 
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necessary and desirable attribute of complex systems. Homeokinetics predicts 
the discovery of numerous cyclicities and evidence for their interaction. But 
what is the nature of these cyclicities and what form does their interaction 
take? 

An important caveat at the outset (though it shall not deter us here; 
instead it is the impetus for the present work) is that the mathematical 
analysis of non-linear oscillators has hardly begun (cf. Pavlidis, 1973) • In 
contrast, for linear systems, motion in time is relatively easily described, 
even though the formula describing motion, x = f(t), can be quite complicated. 
Such functions are conceived of as a system of derivatives, from zero order 
(the position itself, x) to high order, expressed by a differentia] equation 
whose degree is given by the exponential number of the highest differential. 
In linear -systems, motion is regarded as a linearly additive system of first 
degree differentials whose coefficients may be constants or functions of t, 
but not functions of x. The family of functions described by linear 
differential equations are open to solution by various methods of integration. 

In sharp contrast, there are no general solutions for non-linear differ- 
ential equations of motion. For example, in the famous van der Pol equation: 

mx + kO-x2)N jc + bx = 0 (3) 

where x = displacement 

k = stiffness 
b = damping 
m = mass 

the stiffness coefficient k is itself a function of the dependent variable x, 
giving rise to the non-linearity and thus negating a unique solution. 
Fruitful insights into non-linear systems are obtained by graphical methods 
called phase plane techniques, which plot the first differential, velocity x 
against displace, ent x. A set of simple examples is given in Figure 1. 
Consider a linear differential equation, jc + u>2 x = 0. When integrated, the 
equation yields a set of phase ellipses of tne form (i)2 + (wx)2 = c. One 
such ellipse is shown in Figure 1A and represents the relation between 
velocity and position in a simple oscillation. The curves themselves are 
called phase plane trajectories; it is clear in Figure 1A that the phase plane 
represents a stable, periodic motion since velocity and position repeatedly 
return to a certain value (for further details, see figure caption). The 
spiral trajectory shown in Figure 1B represents an oscillation with continu- 
ously decreasing amplitude until it reaches a standstill. A spiral inwards 
constitutes a damped oscillation; if the direction were outwards (not shown), 
the oscillation would be unstable with increasing amplitude. 

The important point to realize about the phase trajectories illustrated 
in Figure 1A and 1B is that they are based on linear systems. A change in any 
parameter, such as the damping coefficient, drastically changes the form of 
the solution (and thus the phase trajectory). There is then, no preferred 
solution or set of solutions in a linear system. This is not the case in non- 
linear systems in which all trajectories tend asymptotically toward a single 
limit cycle despite quantitative changes in parameter values (see Figure 1C) . 
Thus, a highly important property .of limit cycles is their structural 
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PHASE PLANE POSIT I ON -TIME 

TRAJECTORY FUNCTION 
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Figure 1. Phase piane trajectories and conesponding position-time functions 
for three different types of oscillation. , 

A. Idealized harmonic motion 

B. Damped harmonic motion 

C. Limit cycle oscillatory motion (see text for details) 
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stability in the face of variations in parameter values. That is to say, 
limit cycles exhibit a tendency to maintain a fixed amplitude and frequency (a 
stable, orbital trajectory) no matter how perturbed (cf. Hanson, 1978; Minor- 
sky, 1962; Oatley & Goodwin, 1971; Pavlidis, 1973). Furthermore, in order for 
non-linear oscillators to offset precisely the energy lost during each cycle 
(in the drift towards equilibrium) they must degrade a large amount of free 
energy (cf. Hanson, 1978; Yates & Iberall, 1973). Because of high energy 
exchange, non-linear oscillators are quickly resettable following external 
perturbations. As we shall see, the rapid return of limit cycles to their 
preferred frequency and amplitude following experimentally imposed perturba- 
tions is a prfe-,' ainant feature of the data in the present experiments. 

As we have already hinted, and as Pavlidis (1973) further emphasizes, 
coordination in biological systems arises from cooperative relationships among 
non-linear oscillator ensembles. In sharp contrast, linear oscillators do not 
interact among themselves— a fact that is based on the superposition principle 
in the theory of oscillations (Minorsky, 1962). As defined, the superposition 
principle says that oscillation in a system of several degrees of freedom 
consists of a number of component oscillations, each independent of the other.- 

An essential property of non-linear oscillatory systems, though, is that 
they always exhibit interaction/ Perhaps the chief mo^e of cooperation among 
self-sustaining oscillators (and germane to the present experiments) is that 
of entrainment or synchronization. 1 * Apparently, the .entrainment phenomenon was 
first observed by Huygens in the 17th century (cited by Minorsky. 1962). 
Huygens noted that two clocks whose "ticks" (osc ill t. .ions) were out of step 
became synchronized when attached to a thick wooden board. Some two hundred 
years later, physicists studying electrical circuits and acdustics 
rediscovered the • synchronization ' effect. When an electrical force ol 
frequency W is applied to an electron-tube oscillator (frequency = V^) t the 
"beats" of both frequencies are apparent. As the frequencies get closer 
together the beats diminish' untiL, at a certain difference value w-w Qi theVj 
disappear entirely and a 'singT e frequency, W 1t remains. Similarly when two or 
more oscillators interact, mutual entrainment occurs (the -magnet effect of 
von Hoist. 1973) with only a small detuning of their frequencies (Minorsky. 
1962) Also if the frequency of one oscillator is an integer multiple ol 
another to which it is coupled, then subharmonic entrainment-another form of 
• mutual interaction— takes place (also called frequency d em ulti plication) . 

In sum. entrainment is an emergent property of a system of .ion-Linear 
oscillators; it is truly a self-organizing process in the sense that a 
collection of mutually entrained oscillators functions as a single unit. it 
indeed biological systems are composed of limit cycle oscillatory processes, 
then the so-called "degrees of freedom problem"— apparent m much of the 
current theorizing ornmovement control— may be minimized through recourse to 
the entrainment property of non-linear systems. Moreover . entra ^"^" SU ^ 
that the degrees of. freedom may be aissipated with maximum efficiency .and 
minimum energy cose. Let us consider how this dynamical view interfaces with 
behavioral experiments. 
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6. THE PRESENT EXPERIMENTS 



There is a., rich empirical background in support of the present 
theoretical perspective. But little of the work is on human subjects (see 
Cohen, 1970; and Schepelmann, 1979, for exceptions), and none of it is 
physically (dynamically)- founded. Rhythmical phenomena abound at all 
phylogehetic levels and in many different activities (see Delcomyn, 1980, and 
Stein, 1976 for reviews). Some of the early work on interlimb phase control 
by von Hoist during the 1930 f s is classical in this respect, although it is 
not well known ( see von Hoist, 1973, for English translation) . In his 
experiments on fish fin movements for example, von Hoist identified two main 
types of coordination. The first of these he ternfed absolute coordination — a 
1:1 correspondence between cyclicities of different structures (i.e., where 
phase and period are the 3ame) . The second — much less common interactive 
state — was called relative coordination . Here the fins exhibited different 
frequencies although at least one of the -phases usually corresponded to that 
observed during the absolute coordination state. 

Relative and absolute coordination bear a close similarity to the 
entrainment properties of non-linear oscillators 'that we have addressed in the 
previous section. 'More recently, Stein (1976, 1977) has elaborated on von 
Hoist's work using the mathematics of coupled oscillators to predict interlimb 
phase relationships based on the activity patterns of "coordinating neurons 11 
in cockroach and crayfish (see a] so Graham, 1977). As Stein ( 1 977) notes /the 
oscillator theoretic approach to neural control is in an embryonicytftate . But 
an important first step (which we attempt' to take' here) is to examine the 
qualitative predictions of the theory. 

For present purposes, there are two *>mpirical antecedents to the 
following set of experiments . The first is the finding, discussed earlier,, 
that muscles acting at a joint exhibit .properties that, to a first 
approximation, are qualitatively' like a mass^-spring system. The important 
point to realize is that such a system ?s intrinsically rhythmic or cyclic 
even though it does not have to behave rhythmically or cyclically. Thus, 
depending on its parameterization,, a mass-spring system may or may not 
oscillate. Discrete and cyclical behaviors may therefore arise as different 
manifestations of the same underlyin g organization (cf. Fel'dman, 1 966 ; Fowler 
et al., 1980; Hollerbach, 1980; Kelso & Holt, 1980). The present experiments 
are continuous with this theme and examine cyclical movements per se. 



The second antecedent for the current work comes from earlier studies of 
human interlimb ' coordination (cf. Kelso, Southard, & Goodman, 1979a, 1979b). 
When subjects perform movements of the two limbs to different sized targets at 
different distances from a home position, they do so almost simultaneously. 
Moreover, the limbs reach peak 'velocity and peak acceleration at practically 
the same time during the movements (see also Martei iuk & MacKenzie, 1980). If 
subjects are. required to move both limbs tc separate targets, but one limb 
must hurcJle an obstruction on the way, the other unobstructed hand describes a 
similar ?rc — at least oh the first few trials (Kelso, Putnam, & Goodman, Note 
1). When simultaneity is disrupted by instructing the subject to strike one 
target before tne other, there is a sizable cost (either in initiation time or 
movement errors; compared to temporally compatible, simultaneous conditions 
(Goodman & Kelso, Note 2; Keiso et al . , 1979a; see also Klapp, 1979). 




The ubiquity of . tim-ihg constraints in movement is consistent with the 
clock-like behavior of mutually entrained oscillators (see Keele , 1981, for 
further examples, but a different interpretation), which, as we have 
emphasized, has a firm dynamic basis. 5 It remains now for us to examine (and 
to illustrate) in a more direct manner some of the interlimb relationships 
predicted by the properties of limit cycles. One of the dominant techniques 
used to establish the autonomy of rhythms is to perturb the system away from 
its steady state behavior and observe the manner in which the system 
reorganizes itself (for example, this procedure is paradigmatic of circadian 
rhythm research, see Menaker , 1976). Only a system of non-linear oscillators 
will exhibit maintenance of frequency and amplitude (i.e., a bounded phase 
trajectory or limit cycle, see Figure 1C) despite perturbations. Thus in the 
first four experiments reported here (Series A), we examine the foregoing 
prediction in a simple way. As the fingers' of the two hands perform cyclical 
movements, one or" another is unexpectedly perturbed by tne injection of a 
brief torque load or a constantly applied load supplied by D.C. torque motors. 
Our interest is to define the qualitative features of the system's response to 
perturbations as revealed in phase / amplitude , and frequency characteristics 
^ of each hand . * - 

The, remaining aspects of non-linear oscillatory systems that we shall 
Dursue here focus on resonance and entrainment properties. First, non-linear . 
' oscillators (unlxke linear ones) do not exhibit resonance-an amplitude 
increase-when driven at their preferred frequency. As noted by Iberall and 
McCulloch (1969). a fundamental observation about human motor behavior is that 
externally it does not appear to have a strong metric, but interally it must 
have. Complex humans operating homeoki net ic ally are self-timed and not tied 
(in any stimulus-response manner) to external- cue constraints. Thus the 
stability, properties of limit cycles predict that their orbit should remain 
fixed over a fairly wide range of driving frequencies. Second, non-linear 
oscillators that, are coupled together will exhibit (a) mutual entrainment if 
allowed to function at their preferred frequency and (b) subharmomc (. small- 
integer) entrainment if. one is driven at a different rate from the other. As 
we- shall see. the data from the remaining experiments (Series B «nd I C) are a 
testimony to the powerful entrainment feature of non-linear (limit cycle) 
oscillators. 

GENERAL METHODS AND PR OCEDURE 

The apparatus used in all the experiments to be reported here consisted 
of a finger positioning device and associated programming electronics the 
details of which are described in an earlier paper (Kelso & Holt 1980). Q 
Essentially the apparatus . consisted of two freely rotating supports that 
allowed flexion and extension of the index fingers about the 
metacarpophalangeal joint in the horizontal plane. Situated above the center 
of rotation of each -support were programmable D.C. torque motors. An 
electronics control package permitted programming of torque motor output with 
respect to movement of the finger-in either direction. Thus, in , some of the 
experiments ( 1 to 4 ; Series A) perturbations of either digit could be appl ied 
for a short duration (termed "brief .load") or for a prolonged period 
("constant load"). Potentiometers mounted over the axes of motion provided 
analog signals from which digital representations were obtained via analog-to- 
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digital conversion at a sampling rate of 200 Hz. Additional onset and offset 
of load perturbation (Series A) and a metronome timing signal (Series C) were 
specified by analog impul ses . A package sof conversion routines — implemented 
on a PDP-11/45 computer — was us'ed to convert the digital signals and display 
them as time-domain displacement tracings. Additional programs provided mean 
amplitude, frequency ,' duration and phase information (cf. Goodman, Rubin, & 
Kelso, Note 3)- . . 

On entering the laboratory, subjects were familiarized with the movement 
apparatus and seated in a dental chair so that both arms and hands could be 
comfortably secured in the positioning device . The procedures were described 
to the subject and any questions answered. The task throughout all 
experiments was to move the index fingers of one or both hands in a cyclical 
( flexion-extension) manner from the onset of an auditory start signal to a 
stop signal. For the two-handed task, the movements were always symmetrical, 
i.e., flexion (extension) of one hand was accompanied by flexion (extension) 
of the other. Instructions in all the experiments were to move the finger(s) 
continuously over a 10 sec trial in a way that felt most comfortable and 
required least effort. 

*\ , A. PERTURBATION EXPERIMENTS 0 THROUGH 4 ) 

Specific Methods and Procedures 

The subjects were right-handed male and female volunteers who were paid 
for their services. Two groups of subjects were used — a single-hand group 
(N=6), that completed the experiments using the index finger of only one hand, 
and a bimanual group (N=6) that used the index fingers of both the left and 
right hand simultaneously. In the bimanual group, trials were randomized sq 
that the subject received either nc^ perturbation , a perturbation of the left 
index fingep, or a perturbation of the right index finger. Four trials were 
given in each of the three conditions, a total of 12 trials for each 
experiment. In the single-hand group, subjects were told which finger to move 
and a given trial was either perturbed or not perturbed. 6 

In Experiments 1 and 2, clamps were placed on "the protractor arm of the 
apparatus, the purpose of which was to constrain movements to a 50 degree 
range (40-90 degree extension). In Experiment 1, the perturbing torque was 
set v at 50% of maximun torque output available (40.8 oz-in) for a duration of 
,100 msec, and was int» oduced approximately midway through the 10 sec trial. 
The perturbation was injected in a flexion direction as the finger moved 
througii the extension phase of the cycle. In Experiment 2, the torque load 
was reduced to 25? of m^ximim (20.4 oz-in) and was introduced about half way 
through the trial and maintained until completion of the trial. The constant 
torque load was applied in the directi6n of limb flexion* Thus, during 
extension of the index finger the load oppo'sed movement; during finger flexion 
the load acted in the same direction. 

Experiments 3 and 4 were replications of 1 and 2 with the exception that 
the constraints were removed, allowing subjects to select both amplitude and 
frequency of movement. 
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Data Analysis 



Meajurements of movement amplitude and frequency were obtained from 
converted digital signals. This .procedure relied upon a computer-based 
determination of displacement maxima and minima per cycle in order to arrive t 
at cycle amplitude. Mean values of amplitude and frequency were obtained only 
for those cycles before and after the perturbation cycle, and appropriate 
comparisons made using paired t-tests. 

To examine the phasic relationships between the two limbs (in the 
bimanual group), tvx> analyses were employed. The first involved a computer 
generated display in which the displacement tracings were superimposed such 
that any phasic lag or difference between them could be immed iately observed . 
For example, the lower tracing on Figure 2 shows an almost perfect 
superimpcsition of the upper individual tracings from each individual hand. 

■'In order to quantify phasic relationships, a second method compared the 
two traces using cross-correlation techniques. The cross-correlation function 
for two sets of time-domain data describes the general " de^ndence of the 
values of one set of data on another. For example, if u(t) corresponds to the 
signal for the right index finger and v(t + t ) corresponds to the signal for 
the left index finger delayed by the interval t(tau), an estimate of the 
cross-correlation function for a given tau may be obtained by taking the 
aver-ge product of the two values over the observation time T. The resulting 
average product will approach' an exact cross-correlation as T approaches 
Infinity. In this way the actual phase lag or tau between the waveforms of 
each hand and the correlation at that lag were calculated for all cycles 
before and after the perturbation on each <vrial . 

Results \ 

(a) Frequency between and within Umbs: The pre- and post- perturbation 
analysis for the frequeftcy data in the bimanual group across all four 
experiments is shown in Table 1. Clearly, the results of this analysis point 
to a very tight periodic relationship between pre- and post-perturbation 
cycles within each limb. It may be noted that the frequency changes following 
a brief perturbation range from . 0 to .03 cycles per sec. Similarly, the 
frequency of a limb that subsequently ' receives a constant load remains 
virtually unchanged (0 to .05 cycles per sec). None of the pre- versus post 
differences in frequency attained statistical significance (j> > .05). 

In the single, hand experiments, it can be seen again that frequency 
changes following brief perturbation and the addition of a constant load are 
small, although they tend to be a little higher than in the bimanual case (.01 
to .07 cycles per second, see Table 2). Again, none of the differences 
attained statistical significance ( p > .05), nor is there any systematic 
directional bias in the small frequency changes that are observed. 
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CONSTRAINED CYCLICAL MOVEMENTS 
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Figure 2. Two-handed cyclical movements between physical constraints. The 
right index finger was deflected in the direction of flexion as it 
was extended. In the lower tracing, the two upper tracings are 
superimposed. The almost perfect overlap reflects- the tight period 
and phase linkage between t&e fingers. It is interesting to note 
that the two fingers regained their phasic relationship within the 
same trial as the perturbation occurred. 
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Table 1 

Means and Standard Deviations v of Frequency for 
Cyclical hbvements of Two Hands 

Experiment 1 : Amplitude constrained, brief perturbation 

PRE-PERT POST-PERT 





Condition 


Na 


Meanb. 


SD 


Mean 


SD 


t 


Right perturbed 


: right hand' 
left hand 


23 
23 


1.39 

1.49 
* 


O.^o 
0.56 


1 . M 
1.46 


* 0.44 
0.49 


-0.63 
0.84 


Left perturbed: 


right hand 
left hand 


25 
25 


1.45 
1.45 


0.50 
0.47 


1.43 
1.47 


0.47 
0.47 


1 .98 
-1 .26 


F xnpri mpnt" ?! 

L< A pC 1 llllCll U C • 


Amplitude constrained. 


constant perturbation 






Right perturbed 

> 


; right hand 
left hand 


22 
22 


1.37 
1.33 


0.29 
0.30 


1 .34 
1 .28 


0.27 
0.27 


1 . 12 
1 .72 


Left perturbed: 


right hand 
left hand 


25 
25 


1 .25 
1 .25 


0.31 
0.29 


1 .24 
1 .22 


.0.29 

U . cO 


0.58 

1 73 


Experiment 3- 


Free, brief perturbation 










Right perturbed 


: right hand 
left hartf 


19 

19 . 


1 .68 
1 .69 


0. 18 
0.20 


1.69 
1.71 


0. 16 
0. 16 


-0.78 
-0.53 


Left pertUrbed: 


right hand 
left *iand 


19 
19 


1 .72 
1 .72 


• 0. 19 
0. 17 


1 .74 
1 .73 


0. 18 
0. 14 


-1.27 
-0.29 


Experiment 4: 


Free, constant 


load 












Right perturbed 


: right hand 
left hand 


19 
19 


1.79 
1 .79 


0.19 
0. 18 


1 .78 
1.79 


0. 13 
0. 13 


0.27 
0.25 


Left perturbed : 


right hand 
left hand, 


19 
19 


1 .82 
1 .81 


0. 16 
0. 16 


1.84 
1.82 


0. 19 
0.21 


-0.68 
-0. 19 



1 Number of data pairs 

bFrequency in Hertz (cycles per second) 
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Table 2 



Means and Standard Deviations of Frequency for 
Cyclical Movements of One Hand 











PRE- 


-PERT 


POST- 


-PERT 




FREQUENCya 




CONDITION 


Nb 


Mean 


SD 


Mean 


SD 


t 


'Expt. 


1: 


Lef.; hand alone 


18 


1.56 


G-. 43 


1.53 


0.39 


1.52 






"Right hand alone 


17 


1.48 


0.38 


1.49 


0.47 


0.35 


Expt. 


2: 


Left hand alone 


18 


1.57 


0.48 


1.56 


0.48 


0.67 






Right hand alone 


17 


1.58 


0'.43 


1.49 


0.31 


1.83 


Expt. 


3: 


Left hand alone 


19 


1.82 


0.52 


1.77 


0.54 


2.05 






Right hand alone 


18 


1.90 


0.80 


1.88 


0.51 


0.66 


Expt. 


4i 


Left hand alone 


19 


1.88 


0.57 


1.95 


0.63 


-2.93 






Right hand alone 


19 


2.01 


0.50 


1.99 


0.55 


0.44 



a In Hertz 

^Number of data oairs 



(b) Phase relationships b etween limbs : The tight periodic relationship 
in the pre- and post-perturbation cycles within a limb is matched by a tight 
periodic and phase relationship between the hands. In Figures 2, 3# and U 
exanples of the superimposed displacement curves for each experiment are 
shown. It is clear that despi e the imposition of different perturbations, a 
tight phasic link between the two fingers is preserved. Quantification of 
these data using cross-correlation techniques " verified the extremely close 
period and phase of the two fingers. The mean tau values and the correlations 
at those values averaged across trials for each subject are shown in Tables 3 
and U. Absolute values of phase lag (t) both pre- and post-perturbation are 
small, and correlations (with only a- few exceptions) are extremely high. 

i 

(c) Amplitudes between and within limbs : Analysis of 'the amplitude data 
for the bimanual and single-handed groups is provided in Tables 5 and 6, 
respectively. In general, amplitudes tended to remain constant with a couple 
of anomalous findings (se*, for example, Experiment 4, Table 5. and 
Experiments 1 and 2, Table 6). 
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FREE CYCLICAL MOVEMENTS 




TIME (SECONDS) 



Figure 3. Two-handed cyclical movements without constraints on movement 
amplitude. Note that the right hand showed a greater range of 
flexion-extension than the left, with a greater ''velocity. Phase 
and period of the two hands are again tightly limited and 
reentrairfed within the same cycle after brief perturbation of the 
right irjdex. / 
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FREE CYCLICAL MOVEMENTS 



I RIGHT i:\V£.l 
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ure j4. Two-handed* cyclical movements without constraints on movement 
"amplitude. A constant load was added to the left index' finger in 
the! direction of flexion, while the subject moved in extension. 
The result on the first cycle was a dramatic cdecrease in velocity 
and amplitude. In the remaining cycles, compensation occurred such 
that not only were phase and period relationships maintained, but 
also the kinematics — velocity and .amplitude. 



Table 3 



Cross-correlation Analysis of Pre-Perturbation ^- 
Cycles (Two Hands) 



EXPERIMENT 1 



EXPERIMENT 2 



Correlation 



TALK msec) a 



Correlation 



TAU(msec) 



Subject 


Mean 


SD , 


Mean ' SD 


Mean 


SD 


1 


.94 


.17 


5.71 3.20 


.95 


.02 


2 


.97 


.02 


11.43 4.40 , 


.98 


.01 


J 


.99 


.01 


10.00 9. 5S 


,.98 


.01 


4 


.98 


.01 


14.00 . 4.90 


.97 


.01 


5 


.97 . 


.01 


12.50 8.54 


.97 


.01 


6 


.97 


.04 


5.00 .5.35 




.05 






i 

i 

EXPERIMENT 3 


C 


1 

EXPERIMENT 




Correlation 


_TAU(msecV- s 


Correlation 


Subject 


Mean 


SD , 


Mean SD 

5 


Mean 


SD 


.1, 


.98 




6.25 4 . 84 


.95 


, .04 


2 


.99 


.01 


5.80 3.21 


.95 


.02 


3 


.98 


.01 


5.63 6.34 


.91 


.07 


4 


.97 


.01 


12.50' 2.50 


.96 


.02 


5 


.96 


.oi 


11.43 6.93 


.96 


.03 


6 


.94 


.04 

i 


14.38 17.40 


.94 


.04 

s 


i 

I 

aphase lag with resolution 


to 5 msec 




1 



Mean 

4.29 
0.00 

10.63 
13.75 
12.50 

13.75 



SD 

3-19 
0.00 
8.08 
6.00 
6.12 
6.9-3 



TALK msec) 



Mean 

15.00 
17.05 
15.63 
13.13 
11.50 
16.43 



SD 

11.02 
2.50 

10.74 
4.29 
9.27 

13.02 
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Table *J 



Cross-correlation Analysis of Post-Perturbation 
Cycles (Two Hands) 



EXPERIMENT 1 j EXPERIMENT 2 



Correlation 



? 



TAU(msec)a 



Correlation 



TAU(msec) 



Subject 

1 

2 

3 
4 
5 



Mean I 

.93 
.97 1 
.99 
.98 
.98 



SD 


Mean 


SD 




Mean 


SD 


Mean 


SD- 


.04 


8.57 


5.80 




.87 


.06 


4.29 


6.39 


.01 


C.43 


3.50 


V 


« 








.01 


10.71 


4.95 




.89 


.06 


11.25 


7.4C 


.00 


' 23.00 


9.80 




.87 


.04 > 


43.50 


-.0.31 


.02 


7.50 


5.59 




.84 


.05 ' 


15.63 


9.50 



EXPERIMENT 3 



EXPERIMENT 4 



Correlation 



TAU(msec) 



Correlation 



TAU(msec) 



Subject 


Mean 


SD 


Mean 


SD 


' Mean 


SD 


Mean 


1 


.97 


.02 


_.13.75 


8.93 


- .84 


.06 


10.71 


2 


.99 


.00 


/ 7.36 


2.61 


« 






3 


.98 


.01 


4.38 


4.64 


.72 


.07 


22.50 


4 • 


.96 


..17 


14.38 


7.26 


.76 


.26 


10.00 


5 


.96 


.02 


14.29 


11.16 


.89 


.06 


21.88 


6 


.90 


.09 


21.88 


20.76 


.89 


.05 


10.29 



SD" 

6.78 

13.46 
8.66 
13.21 
10\83 



•insufficient data 



aphase lag with *"esolution to 5 ni'jec 
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Table 5 • 

Means and -Standard Deviations of Amplitude for 
Cyclical Movements of Two Hands 

Experiment 1: Amplitude constrained, brief perturbation 



PRE-PERTb 



POST-PERT 





Condition 


Na 


Mean 


SD 


Mean 


SD . 


■ t 




Right perturbed 


: right hand 
left hand 


23 
23 


50.92 
50; 80 


2.21 
•2.16 


51.50 
18.81 


1.81- 
5.89 


-0. 
1 . 


78 
78 


Left perturbed : 


right hand 
left hand 


25 
25 


51.16 
5K93 


2.31 
1.80 


50.27 
51.31 


5.17 
3.02 


0. 
1. 


86 
16 




Amplitude constrained, 


constant perturbation 








Right perturbed 


: right, hand 
left hand 


22 
22 


52 . 27' 
52.61 


3-31 
6.21 


51.27 
51.21 


1.13 
3. 39 


1 . 
1 . 


62 
03 


Left perturbed : 


right hand 
left hand 


25 
25 


51.31 
51.29 


1.77 
1 .81 


18. 51 
19.92 


8.08 
1.35 


1 . 
1 . 


69 
52 


Experiment 3» 


Free, brief perturbation 












Right perturbed 


right hand 
left hand 


19 
19 


69.76 
66.29 


16.38 
15.19 


67.38 
6U.58 


18.27 
11.35 


1 . 
1 . 


28 
31 


Left perturbed : 


right hand 
left hand 


19 
19 


65.25 
70.52 


11.82 
20.70 


66.35 
70.27 


16.12 
21 .86 


-0. 
0. 


98 
20 


Experiment 


Free , constant 


load 














Right perturbed 


: right hand 
- left hand 


19 
19 


71.77 
67.15 


25.13 
16.50 


75.91 
68.00 


30. U9 
13.79 


-2. 
-0. 


26 * 


Left perturbed : 


right hand 
left hand 


19 
19 


65 • t0 
61.50 


18.01 
18,50 


71.05 
60.72 


20.38 
21 .28 


-1. 
-1 . 


nn«» 

98 



aNumber of data pairs 
b Amplitude in degrees 



«», 



< .05 
.01 
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Table 6 



V 



Means and Standard Deviations of Amplitude for 







Cyclical 


Movements 


of One 


Hand*' 








AMPLITUDE^ 


CONDITION 


N 


Mean 


SD 


Mean 


SD 


t 


Expt. 


1: 


Left hand alone 
Right hand alone 


18 
17 


50.56 
55.05 


2.73 
3.70 


50.71 
56.77 


2.86 
5.8B 


-1.53 
-2.88* 


Expt. 


2: 


Left hand alone 
Right hand alone 


18 
17 


51.68 
57.96 


2.71 
4.71 


52.25 
58.80 


2.35 
5.70 


-3.35* 
-1.15 


Expt. 


3: 


Left hand alone 
Right hand alone 


19 
18 


60. 13 
63. 10 


24.33 
25.29 


61.15 
60.49 


23.63 
21.94 


-0.81 
1.59 


' Expt. 


4: 


Left hand alone 
Right hand alone 


19 
19 


58. 43 
53.88 


22.75 
12.95 


60.35 
51.46 


13.53 
13.66 


-0.60 
1.51 



a in degrees 



•p < .05 



However, in several of the analyses of free bimanual movements, 
(unconstrained), it is clear that the amplitude differences almost reached 
significance and, given .greater power, may have^ done so. Of course the 
foregoing analysis is quite global in the sense 1 that it clouds potential 
anplitfjde adjustments that may have occurred on individual cycles before or 
after perturbations. With this in mind, we compared amplitudes of individual 
cycles in Experiments 3 and ^ (single and bimanual groups) on those trials in 
which at least five cycles preceded, and followed the perturbed cycle (P). 
Table 7 provides a summary of those cyclers reaching significance ( p < .05). 
Out of a possible number of 28 significant differences within each cell, it 
can be seen that the maximum number to reach significance is only eight. 
Furthermore, there does not appear to be any systematic pattern among those 
cycles that is statistically different from any other. Thus, if subjects were 
adjusting anplitude over several cycles following the perturbation t we might 
have expected a larger number of differences between those cycles immediately 
preceding perturbation onset (P-1, P-L) and those , cycles .immediately following 
the perturbation (P+1, P+2). Although there was a greater number of pre- post 
differences in amplitude overall, there was nothing' to suggest— even in the 
individual anplitude data—any progressive recalibration on the first or 
second post-perturbation cycle back to . pre-perturbation values. 
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Table 7 

t-test Matrix of Total Number of Significant 

Differences- Between Individual Cycles. All , 
• Subjects on Expts. 3, 4 One and Two Hands. 

Comparison pre-pert cycles Comparison post-pert cycles 

p _5 P_4 P_ 3 P_2 P-1 3 , p +5 P+1 p +3 p +2 p +1 b 

p _5 3 : 2 1 0 P+5 0 3 2 2 

P_U ' 1 0 2 P+U 2 3 3 

P_ 3 11 p +3 4 3 

P..2 ' *0 P+2 « 



5 



Comparison pre-post 

p + 1 P +2 P+3 P+U P+5 

p -5 3 3 5 5 3 

P _H 5 * U 4 3 2 

p _ 3 3 5 5 3 2 

p _2 2 5 6 U 1 

P_1 i» 6 U 5 1 

a P_N = N th cycle preceding perturbation 
bp +N = N th cycle following perturbation 
CTotal- possible significant differences in each cell = 28 
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Discussion 



The finding that frequency (period) within a finger remains constant ^ 
despite the addition of brief and constant loads strongly supports the view 
that the limbs behave in a manner qualitatively like limit cycle oscillators. 
The fir-st criterion for limit cycle processes — that frequency (period) and 
amplitude tend to be maintained despite perturbations — receives good support 
in our data. Further support for the limit cycle account is evident in the 
tight phase and period relationships between the fingers of the two hands. 



Perhaps the crucial test of the * entrainment property of non-linear 
oscillators is the perturbation experiment. It is clear in all four 
experiments reported here that, quickly after the perturbation of one finger, 
both fingers become re-entrained within one or two cycles; there is neither a 
phase lag nor a difference in periodicity between the two fingers. This 
result is reminiscent Qf work in animal locomotion. Thus Shik and Orlovskii 
(1965) temporarily prevented one of the limbs of a locomoting cat from 
initiating the transfer phase of its step cycle — a procedure that necessarily 
disrupted the phase relations among oil four limbs. Within a few full cycles, * 
however, the limbs returned to their previously established phase 
relationships, as in the present experiments. 

r 

The general finding that amplitude tends to be preserved in both limbs in 
spite of load perturbations applied to* one, provides additional support for 
the limit cycle oscillator view and extends previous work on single trajectory 
movements (e.g., Fel'dman, 1966a, 1966b; Kelso & Holt, 1980; Polit & Bizzi, 
1973) to voluntary cyclical movements (see also Fel'dman, 1980). A noteworthy 
feature of the data is the kinematic differences within and between limb* when 
the system is perturbed. For example, in Figure 4, even though frequency is 
maintained in the constant load condition, velocity (as reflected in slope 
differences) and amplitude differ for the two limbs. This finding draws our 
attention to a fundamental point that we have made in earlier papers; observed 
kinematic details are consequences of the system's dynamics (e.g., mass,* 
* stiffness, damping) and are determined by those dynamics, (cf . Fowler et al., 
1980; Kelso & Holt, 1980). 



Thece are two additional properties of non-linear, limit cycle 
oscillators that we shall consider under the heading of entrainment. The 
first is that when coupled, oscillators of slightly different frequencies will 
tend to entrain at some intermediate frequency. As von Hoist (1937/1973) 
remarked, the striking feature of coordinated movements is their "accordance 
in tempo." He called this frequency detuning or mutual entrainment property 
"the magnet effect": simply a tendency of one rhythm to impose i' s tempo on 
another. A second 'form of mutual interaction among oscillators occurs if the 
frequency of one is an integer multiple of another to which it is coupled, a. 
property termed subharmonic entrainment or frequency demul tipl ication . In the 
following experimental demonstrations, both types of oscillatory interaction, 
mutual and subharmonic entrainment, are clearly evident . 




B. FURTHER DEMONSTRATIONS OF ENTRAINMENT 
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1. Mutual Entrainment . 



The basic tack on this issue was- first to determine the. preferred 
frequency of each limb In isolation * and then "examine possible interactions, 
between' the limbs when they perform together. The procedure was similar to 
that employed in the previous experiments. Each of six, right-handed subjects 
(none of whom participated in any of the previous studies) completed four 
trials with the left hand only, the right hand only and both hands combined. 
The twelve tria*s7>ach lasting 10 sec, were randomized for all subjects. The 
subject was ifstryfcted to move the finger(s) cyclically at b frequency and 
amplitude thatVCelt most comfortable. No constraints or perturbations were 
imposed at anytime i^d-ortng^ the trial. 

^^a K / amplitude and frequency data for each trial were again obtained from 
the converted digital signals. The means for all subjects are shown in Table 
8% The frequency data meet the predictions of -limit cycle oscillators in that 
the left hand is "attracted" to the right hand, which in turn shows only a 
very small and statistically insignificant frequenc modulation. Clearly the 
overall effect is modest, as one would expect in an experiment using simple 
cyclical movements of the two hands. There are obvious ways to amplify the 
extent to which one limb fmposes its rhythm on another, by changing, for 
example, limb dynamics (e.g., mass, lever arm) or by fatiguing one limb or the 
other. Our intent here, however, has been simply to demonstrate the mutual 
entrainment effect. That the right hand does exert an. "attracting force" on 
the leflT in right-handed subjects illustrates, another mode of cooperation 
among oscillators. This finding is also a testimony to the difficulty 
subjects have in performing two different rhythms at the same frime. 
Entrainment, we suspect, represents a major limitation on what activities can 
■actually be performed. As von Hoist (1937/1973. 1939/1973) remarked, it 
appears to be an important principle of central order. Yet entrainment has 
received little or no attention in theories of how skills are acquired, or how 
movements are controlled. 



Table 8 

Means and Standard Deviations of .Preferred Frequency 
and Amplitudes in One-handed and Two-handed Cases (N=6) 







One 


hand 




Two hands 








Left 


Right 


Left 


Right 


Combined 


FREQUENC Ya 


Mean 
SD 


1.973 
0.03 


2.001 
0.O5 


1.996 
0.03 


1.995 
0.03 


1.996 
0.03 


AMPLITUDEt 


Mean 
SD 


62.95 
4.31 


66.91 
3.59 


57.57 
9.17 


65.84 
3.84 


61.71 
4.01 



ain Hertz 
bin degrees 
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2. Subharmonio Entrainment 

There is at least one situation in which individuals do not experience 
difficulty in performing separate rhythms with the hands, and that is when 
they both share a common time base. Recently Klapp (1979) has shown 
"interference" between the two hands in a repetitive key pressing task when 
the periodicity of each is different, as measured by the -subject's ability to 
match a pacing tone. When the periodicities share common timing, even though 
shifted in phase, no such interference occurs. These results may be accounted 
for by another property of oscillator interaction, namely, low integer 
sub harmonic entrainment. This form of entrainment is such -an overwhelming 
phenomenon in human motor activity that we simply illustrate it here (see 
Figure 5).- Subjects were asked to move one finger at their preferred 
frequency and to move the j^ther finger at a different frequency. We show two 
examples of one subject's performance. As illustrated in Figure 5. the 
oscillation of the limb moving at slower frequency exactly coincides with the 
appropriate oscillation at the faster frequency-r-one is a simple (2:1, 3:2)o 
subharmonic of the other. 7 The example also illustrates the interesting 
property of amplitude modulation (von. Hoist's superimposj tion effect). Thus, 
on some coinciding cycles, a "beat" phenomenon can be observed (particularly 
in the 2:1 ratio; in which the amplitude of the higher frequency hand 
increases on the ? occasion that the lower frequency oscillation takes place. 
Note also* that 'in the lower frequency oscillation the finger is nev^r 
completely still. Although its amplitude is much smaller, it is clear that 
there is a small oscillation, especially in the 2:1 ratio condi^on. In 
effect, both fingers are cycling at the same frequency— only the amplitude or 
force distribution to each finger varies. < The foregoing result fits, nicely 
with recent work by Shaffer (1980) on highly skilled pianists. Shaffer 3^owe^ 
that the pianist's right hand carrying the melody plays with more "weight" 
than the -left,- and that gradual and sudden changes in both hands -can be made 
without disrupting timing. It is tempting to suppose that Shaffer's pianist 
is displaying (admittedly in a more refined way) the basic superimposition 
principle for combining the outputs of coupled oscillators (see also 
Gallistel, 1980, and Kelso, 1981, for additional examples). 

It is worth noting that the entrainment properties demonstrated above are 
net restricted to movements of the fingers but are also apparent in systems 
that share little or no anatomical similarity. In an analysis of the 
interrelationships "between speaking and manual activity (Kelso , Tuller , & 
Harris, in press), we have shown that subjects, when asked to repeat a simple 
syllable (the word "stook") at a different rate from their preferred finger 
rate, do so by employing low integer subharmonics. The situation is reversed 
(though not necessarily symmetrically) when subjects are instructed to move 
their -hands at a rate different from their preferred speaking rate. Again, 
the ratios ohosen are always simple ones (e.g., 2:1). We interpret these 
preferred relationships as emergent characteristics of a non-linear ocillator 
ensemble; the collection of entrained oscillators functions in a single 
unitary way. Entrainment therefore ensures a stable resolution of 
Simultaneous temporal processes throughout the whole system. Moreover, the 
form of entrainment is limited to a relatively restricted range of preferred 
relationsnips— a feature captured in Iberall and McCulloch's ( 1 969 ) phrase as 
an "orbital constellation." 
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TIME (SECONDS) 



Figure 5. An example of /a subject's response to instructions to move the 
fingers at different frequencies. Beats in the lower frequency 
hand tended to coincide with the beat^s of the higher frequency 
hand. On so(ne coinciding cycles, a "beat" phenomenon can be 
observed in /which the amplitude of the higher freqi— .cy hand 
increases in/ relation to non-coincident cycles (see especially the 
2:1 ratio). ' In addition, slight movement during the "quiet" phase 
of the low frequency rhythm is indicative of the hignly constrained 
nature of t'he two-hand linkage. 
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C. NON-RESONANCE PROPERTIES 

A characteristic of a linear oscillator, but not of the non-linear limit 
cycle oscillator ' under examinatior^here t is that when driven at its 
fundamental frequency it will dii/spH^^isonance — a behavior that results in an 
increase in amplitude of oscillation. A final experiment examined whether' 
amplitude changes would be observed when the limb was driven by an external 
rhythmic source at its preferred frequency, and at other frequencies higher 
and lower than the preferred frequency. 

Four paid volunteers, none of whom had participated in the earlier work, 
took part in this study. The procedures were veVy similar to those already 
discussed, with the following exceptions. Before the experiment proper, the 
preferred frequency for each individual subject was obtained by .finding the 
mean and standard deviation of five trials (10 sec each) in which the subject 
chose his/her rate and amplitude. Our prevous work indicated that this was 
more than enough time to achieve stable measures. Thtse data were then used 
as the basis for driving frequency manipulations effected via a visual 
metronome. Seven frequency conditions were used (F 0f the subject's preferred 
frequency F Q + 2 SDf Fo + 4 SD, Fo + 6 SD). For example, if a subject had a 
measured preferrecTTrequency o? 1.5 Hz wITh a standard deviation of 0.075^2, 
he/she would be asked to produce one flexion-extension cycle to each metronome 
beat under the following conditions: 1.5 Hz f 1.5 + .15 Hz. f .30 Hz. f 

1.5 + .45 Hz. Five trials were given in each of the seven conditions that 
were randomized for each subject, and movements were two-handed in all cases. 

The mean amplitude and frequency are displayed in Table 9 for all seven 
driving frequency conditions. These dat^ were analyzed in a 2 x 7 x 5 (hands 
[left/right] x driving frequency x trials) within, subjects analysis ' of 
variance. None of the amplitude comparisons was statistically significant, 
but there were definite effects *on frequency in response to the driving 
stimulus conditions, F(6, 18) = 19.28, j> < .01. None of 'the subjects had any _ 
difficulty performing the task in any of the driving conditions, as confirmed 
by scanning the graphical output of metronome and' displacement waveforms. ,0ne 
such example illustrating a fixed maintenance of amplitude across the most 
extreme driving conditions is shown in Figure 6. 

Nevertheless it is apparent from the mean data that there are tendencies * 
for amplitude and frequency to be linearly related particularly at faster 
driving frequencies. Paradoxically, and if we were dealing with linear 
oscillators, amplitude should increase with slower driving frequencies — a 
prediction not borne out by the present data. 

In short, the responses of the limbs to different driving frequencies 
seem to display both linear and non-linear characteristics. This is not 
particularly surprising for non-linear systems are capable of exhibiting ' 
linear behavior over a range of parameter values. The distinguishing feature 
of non-linear oscillators is that their behavior can be dramatically altered 
in terms of phase and amplitude when driven at certain frequencies. Much more 
needs to be done to determine if, and under what conditions, amplitude (and/or 
phase) changes occur as "saltatory jujips" at certain frequencies; that is, 
when one stable orbit is forsaken for another. 8 
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Table 9 N . 

* «* 
Means and Standard* Deviations of Frequency and Amplitude 
of Two Hands When Driven b/ Light Pulses at 
Different Frequencies (N=6) 

^ . FOc FO+2 SD FO-2 SD FO+-4 SD FCM4 SD FO+6 SD FO-6 SD 

FREQUENCYa. Mean 1.82 1.92 1.74 2.U ' 1.61* 2.34 .* 1.53 
Left 

' SD 0.02 0.02 0.03 0.02 0.03 0.03 0.06 

Mean 1.82 1.92 1.71 2.11 1.61 2.33" 1.55 

Right 

SD. 0.03 0.02 0.01 0.03 0.01 0.03 * 0.04 

AMPLITUDE* -Mean 44.69 48.4"0 47 . 06 40.2 9 4 0 . 85 37 . 28 44.17 
Left 

- « 

' SD J S-J55 5.03 3.64 1.75 3.35 2.54v 4.43 

Mean 52.38 49.98 50.84 45.55 46.25 40.10 49.71 

Right 

SD 3.01 • 5.86 3.18 2.41 3.04 1.66 4.44 

# 

a in Hertz 

/- 

D in degrees 

^reflects mean preferred frequency 
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Tracings indicate a constant ampl itudein the two hands when driven 
at different frequencies by a visual* metronome Ca pulsing LED). 
The onset of a light pulse in most cases was synchronized with, the 
point of maximum extension or flexion. Slight deviation's of the 
finger oscillations from the metronome period can be observed in 
some cycles. 
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The point that should not be lost here is that ospillatory systems can 
behave in a linear manner yet .still be .non-linear in nature. Consider the 
intuitive example of dolphin locomotion. At low speeds, .dolphins cruise w^th 
little energy .cost because o.f their streamlined sffape.; wa^er flow over their 
body surface is laminar, AtTugher swimming speeds, however* the creation of * 
turbulence can increase energy cpsts by an order of magnitude (see Brookhart & 
Stein ,^1980). As the relationship between velocity and energy costs becomes 
non-linear , the dolphin divorces the swimming mode for its novel and more 
economical " running 1 ^ mode. In shdrt, scale changes reveal -the non-linearities 
in -the- system and are capable cf effecting qualitative changes in behavior, A 
major enterprise then for a science of movement becomes one of identifying the 
nece*ssary conditions under which such bifurcations or phase transitions occur 
(for clues , see Yates, in press; Yates et al,, 1972 ) 
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GENERAL DIS CUSSION 



In -previous papers and in the introduction to « .this one, we have 
considered wa^ys in which the motor system might solve the degrees of freedom 
problem, first identified by Bernstein (1967). Our approach is, double-edged: 
first, we have sought to identify and analyze rigorously functional groupings 
of muscles — synergies or coordinative' structures — constrained -to act in a 
unitary way; and second, we have attempted \to establish the language of 
dynamics as the appropriate vocabulary upon which to rationalize the 
properties of coordinative structures. More specifically, we have provided 
theoretical grounds suggesting that a group of muscles' organi zed as A single^ 
functional' unit possesses behavioral properties qualitatively simifar . to a 
non-linear- oscillator, "'mat constrained collectives of muscles exhibit a 
likeness to' oscillatory mechanisms was. intuited many years ago by Bernstein* 
(1947) (see. Greene, 1978) and advanced on empirical grounds by Fel'dman's 
mechanographic analysis of arm movements (Asatryan & Fel'dman, 1965; Fel'dman, 
1966a, 1966b) , More recent data (see Section 4) have borne out Fel'dmarfte 
work, in detail, and led to an elaboration .of the potential advantages thats. 
such a view offers 'in terms of economy of neural control (cf-. Greene. 1978; 
KeLso & Holt, 1980.). 

The present research is continuous with the above cited work, which 
suggests that muscles acting at a joint— to a first approximation—exhibit 
behavior qualitatively similar to a mass-spring system. But it also 
recognizes the" universal fact that biological devices must necessarily 
incorporate non-linear features- if local thermodynamic losses are to be 
compensated for and behavior sustained. flhe mathematical description for 
persistent cyclical operation is known as the limit cycle, 'and the present 
experiments have shown that patterns of coordination between the limbs (and 
within a single limb) can be accurately predicted from limit cycle properties. 
To summarize briefly, we have shown a •clear-cut tendency for cycling limbs to 
maintain fixed • amplitude and frequency under a variety of experimental 
conditions (brief and constantly- applied load perturbations, the presence and 
absence of fixed mechanical constraints, different external driving 
frequencies;. In addition, the tight phasic and timing relationships observed 
'before and after imposed perturbations, as well as the demonstration that 
limbs cycling at> different frequencies Veveal non T arbitrar y subharmomc 
relationships, attests strongly to the entrainment property of non-linear, 
limit cycle oscillators. -^j 
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'--^ Though motivated by <qiute different concerns, the present experimental 
results fit rather well with Fel'dman's most recent work (cf. Fel'dman, 1980), 
in which subjects performed rhythmic movements of the right elbow joint dt 
different frequencies and under "Various loading conditions. Like us, FelMman 
found that after unloading, rhythmic movement was preserved without visible 
change of either phase or frequency (see Fel'dman, 1980, Figure 5). According 
to Fel'dman, this result indicates that the "central generator" is independent t 
of affenent influences created by the unloading. The latter conclusion is 
compatible with new data on phasic movements q_f the ankle joint (cf.. Gottlieb 
&^*"Agarwal, in press). When torques were applied at various pc>ints during 
movement initiation and execution,* both early, myotatic (40-100 msec) and 
later, post-myotat :c (100-300 msec) reflex rer^onses to load changes were 
suppressed. 'Such was the carse regardless of whether the phasic movements were 
of a slow, ramp nature or a fast, ballistic nature. Gottlieb and Agarwal (in. 
presS) suggest that there is substantial "preprogramming" of both types of 
movement > and that phasic movements in general are not assisted by effective 
lo^d-compensating reflex mechanisms. 

While the neurophysiological findings of Fel'dman and Gottlieb and 
Agarwal provide encouraging support for the present work, their focus is on 
the details of the neuromuscular response to externally imposed loads on a 
single joint. In contrast, the present experiments used load perturbations 
(among other manipulations) as a tool to discover patterns of coupling between 
thje limbs, predictable from the properties of non-linear oscii\ators (see also 
Yamanishi, Kawato, & Suzuki, 1979, 1980,. who use discrete vi: jal , verbal, or 
manual events to perturb cyclic finger tapping). Of the latter, we have 
argued that the property of. entrainment may be most significant for a viable 
theory of coordination .9 

As an example, entrainment (often under a different name, see Section 6), 
may well account- for the stable phase and timing relationships that 
characterize the various gaits of animal locomotion (for an excellent review, 
see Gallistel, 1980). It now appears that, interneurons — at least in the 
central nervous' systems of crayfish and cockroach— carry coupling or 
entraining signals that .enable one limb to ' coordinate with another 
. (cf. Delcomyn, 1980; Stein, 1976). The role of such "coordinating neurons" is 
currently being explored within a conceptual framework provided by the' 
mathematical theory of coupled oscillators (Stein, -1976, 1977). 

While we are sympathetic to the foregoing enterprise, we are also 
reminded of Davis 1 ( 1 976) warning that properties of command f oscillation f 
coordination, and so on are not invested in any specific neuron. Coordination 
and oscillation are functions that reflect the interaction of cells, and are 
most correctly thought of as emergent properties. Davis (1j)76) provides 
several examples in which a particular function arises' from a neuronal ^network 
even though no single neuron within the network possesses that function. 

The theme that functions like coordination are emergent, a p osteriori 
consequences of systematic interactions among cells (or muscles), as opposed 
to a priori prescriptions invested in a single cell (or program), is consonant 
with the dynamical perspective offered here (see' also Fentress, 1976, for a 
similar "relational dynamics" perspective). For the physical theory of living ^ 
systems (homeokinetics) , entrainment is the chief mode of cooperation among' 
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self-sustaining oscillators; it is an emergent, self-organizing process in the 
sense that a collection of mutually entrained oscillators functions as a 
single unit. Therein lies its appeal, of course, for a principled solution to 
the degrees of freedom problem. 

A further, and by now self-evident consequence of the homeokinetic view 
concerns the role of oscillation. It has long been recognized that cyclicity 
lies at the heart of biological functioning (cf. Goodwin, 1970, p. 8). Yet in 
the domain of movement, it has been commonplace (with certain notable 
exceptions) to consider fluctuating events as mere nuisances — as unwanted 
sources of variability. It has been easier to model control in terms of 
* quasilinear servosystems than to search for ways in which oscillation may be 
exploited . In closed-loop servomechanisms, oscillation is undesirable because 
it means that there is a discrepancy between the input and" the reference level 
or set point, and hence the system is unstable. However, a more important 
role for oscillatory processes is in' n'on-linear systems that do not possess 
reference levels but that attain stability by virtue of entrapment. 

• * The present studies and our preliminary work on the interaction of speech 
^ and gesture (Kelso, Tuller, & Harris, in press) are motivated by the latter 

* theme. Relatedly, the approach to understanding motor control pursued in 
these experiments follows precisely the line of research proposed recently by 
Delcomyn (1980) following an extensive review of the neural basis for rhyohmic 
behavior iQ lower phyla. Delcomyn (1980) identifies three problem areas, 

• answers to which "...will bring neuroscientists ^uch 'closer to the ultimate- 
goal of understanding how nervous systems function" (p. W). These are: 
«(i) the nature of an . oscillator , (ii) the interaction of oscillators, (iii) 
the way in which sensory inputs interact with oscillators and their output to 
shape the final motor output" (p. H97 ) . The arguments proposed herein suggest 
that principled solutions to the foregoing problems may lie in physical 
biology/ particularly in Homeokinetics and Dissipative Structure theory (see 
Kugler et al . , 1982; Prigogine, Note U; Yates, in press, for comparisons and 
contrasts) . 

In our concluding remarks, we wish to make explicit one further contrast 

* (that may already be apparent) between the present view and current concepts 
in the motor behavior area. As we noted at the beginning of this article, the 
conventional view attributes the regularities we observe in movement to an 
explicit, a priori prescription. But in an oscillator ensemble, there are no 
fixed dominance relationships in the sense that a program or reference level 
stands in a fixed, autocratic relation to the muscles responsible for 
implementation. There are different modes of interaction (e.g., frequency and 
amplitude modulation) and there may be "preferred" phase relationships under 
conditions of maximum coupling. A wide variety of behavioral patterns emerge 
from these interactions; there is structure and presumably a complex network 
of neuronal interconnections to support such cooperative phenomena, but 
strictly speaking, there is no dominance relation. If this view is correct, 
coordination is not prescribed by anything; it is more properly viewed as an 
emergent consequence of the dynamical behavior of a system whose design is 
fundamentally periodic . ' 
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FO OTNOTES 



Ull current theories of regulation and control—mathematical systems 
theory, automata theory, cybernetics — though different in detail are alike in 
likening man to a machine; it is to this issue that we address limitations. 

2Exanples of biological escapements can be found in numerous metabolic 
processes such as the Krebs cycle where release of ATP is triggered by the 
metabolic demands of the cell. 

3Strictly speaking, regulators and servomechanisms are different concepts 
even though it is common in" the motor control area to hear them used 
synonymously. In the former, the reference value remains constant; in the 
latter, the reference value varies continually. 

••Synchronization and entrainment are- often -used synonymously in the 
literature. Strictly speaking, synchronization is that state that occurs when 
both interval and phase of coupled oscillators are matched exactly; 
entrainment refers to the matching of intervals, but one oscillator may lead 
or lag the other. 

5interestingly, in American Sign Language (Klima & Bellugi, 1979) the 

"co ns tr ain ts — ofrserv«d in -the KeXso e t al . experiments a re omnipresent. 

According to Klima and Bellugi (1979): "If both hands move independently 
during a sign's articulation, then the two hand3 must exhibit identical hand 
configurations; the points of articulation are severely constrained with 
respect to one another (they must be in the same location or on the same 
horizontal or vertical plane); and the movements of the two hands must be the 
same (whether performed simultaneously or in alternation). The symmetry 
constraint thus specifies that in a two-handed sign, if both hands move and 
are active, they must perform roughly the same motor acts" (p. 61). 

6on very few occasions, due to malfunctions in analog- to-digital 
conversion, a subject's data could not be registered for future analysis. As 
the Tables indicate however, we still had a sizable nvmber of observations on 
which to base statistical comparisons. 

7Two things should be emphasized in this section. The first is thq* = • 
we offer here is a demonstration of a phenomenon that is coherent with the 
theoretical picture. A more formal analysis of sub-harmonic entrainment is 
being undertaken "nd some preliminary results have been presented (Kelso, Note 
5). Second, as pointed out by R. A. Schmidt (Note 6) the pattern classified 
as 3:2 in Figure 5 is not 3:2 in the musical sense. In fact the- "3" part is 
really a blank-move-move, which is the same as the "2" part. Again, this 
analysis attests to the difficulty subjects have in performing independent 
rhythms with the hands. 

8ln fact, more recent work (reported in Kelso,, Note 5) indicates that if 
the two limbs operating out-of-phase (flexion in one and extension in the 
other) are driven at a certain critical _ frequency, they will change phase 
abruptly (in less than a cycle period) to an "in-phase" pattern. Tne 180 
degree transition in phase brought about by a scalar increase in driving 
frequency is perhaps -the principal feature of one of the simplest non-linear 



systems modelled by the Duffing equation. Thus, through a continuous change 
in one variable (e.g., frequency), a discontinuous change in another (e.g., 
phase) can be observed. That is, an elementary cusp catastrophe or 
bifurcation ^occurs (Saunders, 1980; Thorn, 1975). Importantly, there is no a 
priori prescription for the changed mode of organization anywhere, 

9 Norbert Wiener (1965), in a lecture prepared posthumously, discusses 
the benefits of entrainment as a non-linear, self-organizing phenomenon in 
biology and engineering. There are indications in this paper that Wiener was 
seeking solutions to nervous system organization in the dynamics of non-linear 
oscillators — with ehtrainment as a major feature . Ihis view, " as we have 
argued here, is rather different from the closed-loop, cybernetic theory that 
Wiener popularized . 
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MOTOR CONTROL: WHICH THEMES DO WE ORCHESTRATE?* 
J* A* S, Kelso* and E, L, Saltzman 



When we are confronted with a living system, whose design is mysterious* 
and whose optimizations are obscure, it 'is no easy task — as Professor Stein 
reminds us — to arrive at an answer to- the question he has posed in this paper* 
Stein 1 s target article is an important contribution for two main reasons. The 
first, which we shall mention only in passing, is that it is likely to provide 
much debate on what the controlled variables might be; moreover, it will force 
those who find this a burning issue to put their cards on the table* The 
second, and we feel more important reason, is that tne paper poses a 
question— "What muscle variable(s) does the nervous syste'O control* • •?"— whose 
very" nature raises questions about the strategies nsuroscience uses to 
investigate problems of control and coordination of movement* In our commen- 
tary we will focus on some of the (not so) implicit assumptions behind the 
question posed by Stein; if nothing else we hope to heighten sensitivity to 
some of the issues involved and (perhaps) to force the neur oscie ntist to 
consider his/her epistemology. „~ 

V 

There are a couple of questionaole assumptions in Stein's approach — at 
least as reflected in this article. - The first is that control is the province 
of the nervous system; the second is that it is muscle variable(s) that are 
controlled. We shall examine each assumption in turn and their consequent 
ramifications for elucidating principles of coordination and control. In 
addition,- we shall point to one notable omission in the author's list of 
candidates for control, and in our final remarks take up Stein's invitation to 
advance, albeit briefly, an alternative position, to the control theoretic 
stance that he advocates heYe. Although it may, for the author, seem 
"...natural to assess performance. .. in similar, terms to those applied to 
motors or other devices which produce movement, '■! we assert that there are 
certain fundamental differences between living systems and machines (apart 
from structure) that render such a strategy not only dubious but highly' 
unnatural. Most so-called "machine theories" regard biological control as a 
technical or engineering problem in which the many degrees of freedom to tje 
regulated are a "curse" (Bellman, 1961). In contrast, there are contemporary 
'physical theories yet to be explored - fully in the domain of moyiijnent that 



*An invited commentary on R. B. Stein "What muscle variable(s) does the 
nervous system control?" The Behavioral and Brain Sciences , in press. 
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consider many degrees of freedom and non-linearities to be a requisite (not a 
source of "complication") for the emergence of coherent phenomena. The 
physical strategies being uncovered (principally ir\ the form of Iberall "and 
colleagues 1 Homeokinetic Theory, Prigogine and colleagues 1 Dissipative Struc- 
ture Theory, Haken f s Synergetics and Morowitz f s Bioenergetics; see the final 
part of this commentary for references, and Kugler, Kelso, and Turvey, 1982, 
for contrasts amor.g these theories and their application to control' and 
coordination) stress autonomy, self-organization and evolution of function as 
system attributes — attributes^ that already appeal to some neuroscientists 
(cf. Katchalsky, Rowland, & Blumenthal, 1974; Llinas & Iberall, • 1977; Szenta- 
gothai, 1978). Though we can classify ourselves as, at best, informed 
amateurs in this area, we believe these system attributes will prove difficult 
for the student of movement to ignore. A consideration cf the assumptions 
behind the question posed by Stein may allow us to ground this claim more 
firmly. 

Control as the Province of. the Nervous System 

It Was surely one of Bernstein's (1967) most significant contribution? 
(and he made many that have still to be appreciatea) that control and 
coordination are not reducible to the orchestration of neural signals to and 
from the motor apparatus. Stein appears to recognize this fact m several 
places (e.g., in his discussions of stiffness, *and his awareness of the 
possibility that energy fluxes may shape control), but the paper as a whole 
shows little appreciation of it. In fact, the predominant methodology in the 
studies cited by the author dictates that the organism and its parts are 
quiescent until mechanically or electrically stimulated. Obviously, we do not 
wish to be interpreted as saying that such a methodology has not proved useful 
in many cases or that the effects observed are not real. But control involves 
more than reactivity , and its analysis goes far beyond the deterministic 
input-output approach espoused in Stein's pnper. One wonders to what extent 
the results pf the studies- cited by. Stein, many of which invblve single 
muscles, in non-intact preparations, can be generalized to normal movements in 
prganisms continually interacting with their environments. There are a number 
of grounds for expressing skepticism on this issue (cf. Bernstein, 1967). To 
be blunt, an unequivocal . relation between neural impulses to muscles and 
resulting movement does not, and cannot exist (see Benati, Gaglio, Morasso, 
Tagliasco, & Zaccaria, 1980; Boylls, 1975; Kelso & Holt, 1980; Saltzman, 1 97-9 : 
Turvey, Shaw, & Mace, 1978, for anatomical, mechanical, and physiological 
sources of functional non-univocality or indeterminacy). 

Consider, for example, the task of maintaining the elbow at a steady 
state angle of 180 degrees (i.e., the elbow in full extension). The 
orientation of the ami in the gravity field determines not only the relative 
contributions of muscle and gravity torques at the 'desired elbow angle, but 
also the stability properties of this equilibrium configuration. When the arm , 
is in a downward, vertical orientation, the elbow angle is stable; if the elbow/ 
is perturbed (flexed), it will return to equilibrium due to the stable/ 
restoring torque of gravity. No muscle activity is required in this cas^C 
If, however, the arm is in an upward vertical orientation, the elbow angle 'is 
unstable since gravity plays a destabilizing role in this configuration. If 
the equilibrium angle is to be restored, muscle activity is required to 
provide the stabilizing restorative torque. Thus, the relative contributions 
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of gravity and muscle stiffnesses for a stable equilibrium angle vary with the 
arm's orientation in the gravity ■ field. In short, the nervous system 
"controls" only to the extent that it complements the force field of the 
environment. Its r ole is better envisaged as* exploitative rather than 
injunctive. 

On the Selection of Analytic Units 

In the target article, Stein has attempted to pinpoint the variables . used * 
by the nervous system to^ control muscular activity ■ during the performance of 
sensorimotor tasks. Such tasks might involve a limb that moved, remained 
stationary, or exerted' forces and torques at the limb-environment^ interface. 
It is important to recognize, however, that such controlled "variables are 
defined only in the context, of the organizational structure(s) comprising the 
system-to-be-controlled. These organizational structures -are defined func- 
tionally at a higher level of .description than that of single^ mascles or 
joints. Specifically, they are defined at the relatively abstract task level . 
and, as mentioned in the section above, span the dynamic system composed of 
both organism and environment. In solving the problem posec^ by a task, the 
nervous, system is the indispensable medium through which the 'requisite limb 
organization can emerge. The limbs (or any set of articulators) thus become 
different types of functionally defined, special purpose devices for different 
types of tasks. 

Although the immediately preceding statements may seem trivial at first 
glance, they reveal a perspective that has decidedly non-trivial implications 
for how we approach the* problem of controlled variables. More specifically, 
this perspective leads us to place significant constraints on our selection of 
analytic units of behavior. Professor Stein, for example, limits his analysis 
to "...simple physical variables* appropriate to single muscle's or groups of 
muscles acting normally%round a joint." Few* scientists would disagree that 
some decomposition of the system is, necessary for analytic purposes. However, 
the unit of analysis-should not be casually or arbitrarily chosen, at least if 
the ultimate goal is to understand control in animals (not simply in a single 
joint). Our point'can be made through* an example from physics (cf* Rosen, 

It is well known that the three^body problem defies an analytic solution 
in closed form: Whether the earth-sun-moon system is truly stable is an open 
question. * Although it is possible to decompose the system into one-body and 
two-body' subsystems that are completely tractable analytically, such a strate- 
gy does not faeilitate obtaining a solution to the three-body problem. The 
reason is that' the physical decomposition itself destroys the original 
dynamics. In order to solve the three-body problem, a new set of analytic 
units mtrst be discovered that are defined by new observables, such that the 
partitioning of the system does ' not annihilate the - original dynamics. As 
Rosen (1976) remarks, this partitioning will seem strange to us because we are 
used JfcS^selecting so-called "simple" units that correspond .to some physical 
fractionation of U\e system. The point is- that when we reduce or decompose 
the system, the greatest care must be taken in selecting the 'proper unit -of . 
- analysis. It is most likely, that "simplicity" • (a term with an expeedingly 
slippery definite n) will be neither the ojily, nor the chief criterion 
involved. i 



Returning to the domain of movetoent, the identification of appropriate 
units of analysis has long been a thorny issue, going back, at least' to 
Sherrington's reference to the reflex as "♦♦♦a simple, if not a probable 
fiction^ 1 (Sherrington, 1906). More recently, Greene (1971) in echoing Nicolai 
Bernsteio has remarked that much of our confusion in studying problems of 
coordination has arisen "from our limited ability *to°recognize the significant 
informational units of movement. V However, "there 6 are signs (although only 
considered in passing in the target article) that some consensus may be 
drawing near. There has been a growing appreciation that individual muscles 
(or muscle variables) are not the proper units of analysis for discussing 
, cpordihated movements; rather, such movements are partitioned more naturally 
'into collective functional units defined over groups of muscles and joints, 
within which component elements vary relatedly and autonomously (e.g., Boylls, 
1975; Fowler, 1977;. Kelso, Southard, Goodman, 1979; Lestienne, 1979; 
Nashner, 1977; Saltzman, 1979; Szentagothai & Arbib, 1974; Talbott, 1979; 
Turvey, 1977). The Soviet school (e.g., Bernstein, 1967; Gelfand & Tsetlin, 
1971 ; Shik & Orlovskii , 1976) refers to such muscle- joint groupings as 
linkages or synergies . These terms reflect an appreciation of the fundamental 
problem of control and coordination, namely that of constraining a complex 
system with many degrees of ^freedom to behave in a regular and orderly manner. 

Synergies (or coordinative structures, cf. Easton, 1972; Kelso et al., 
1979;-Turvey et al., 1978) by definition are functionally specific units 
defined over groups of muscles and joints, which constrain the component 
elements to act together in a manner appropriate to the task at hand. Such 
muscle collectives are thought to share ,a common efferent and afferent 
-organization and are deployable as relatively autonompus units in sensorimotor 
tasks (e.g., Boylls, 1975; GelCand, Gurfinkel, Tsetlin, & Shik, 1971 ). 
Coordinative structures as functional units of control are currently undergo- 
ing rigorous analysis in a number of laboratories; they have been identified 
in various tasks and at different levels of analysis (cf. Kelso, 1981; Kelso, 
Tuller, k Harris, in press; Kugler, Kelso, & Turvey, 198J, for recent 
examples). Their chief feature rests in a mutable partitioning of component 
variables into those that preserve the structural ("topological") organization 
of movement (e.g\ , the relative timing and relative force properties of 
muscular events) and those that are capable of effecting scalar transforma- 
tions on these qualitative structures. A theoretical rationale for coordina- 
tive structures has been offered (cf. Kelso, 1981? Kugler et al., 1980, 1982), 
focusing on those properties that distinguish movement patterns that exhibit 
structural stability from those that' do not. 

There are indications in the target article that Professor Stein prefers 
to sidestep the issue of functionally specific units of movement as not 
germane to his interests, ana as one that pertains only to "multi joint 
movements 11 or the "large behavioral literature orr complex patterned 
movement." However, he does not hesitate to negate arguments for length and 
stiffness control on the basis of "complex patterned movements" like speech or 
.piano playing in the case of 'length, and walking in the case of stiffness. We 
welcome the functional argument in^each case, although we note that for Stein 
it involves jumping rather precariously between muscle- joint levels (e.g., 
stiffness and length) and task levels of analysis (such as skiing and ueedle- 
threading). Though aware of the 'problem, Stein seems .to apply a single 
muscle- joint unit of analysis generally, to all types of complex multijoint 
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tasks. Such an approach is at the same time too powerful and too arbitrary. 
It is too powerful because it allows descriptions of movement control that 
fail to distinguish between those acts that do occur- and those acts that are 
physically possible but never occur. It is too arbitrary because single joint 
actions will rarely relate unequivocally to particular task functions. • In 
shortt, when we deal with coordinated activity, we are doling with task 
specific functional units whose degrees of freedom are constrained according 
to task demands, or more generally, to the mutual relationship between 
organism and environment. 

Contrasting Vieys on the Origins of Order 

Whenever we ^obsfer^ve a regular and orderly phenomenon, it is' always a 
temptation to assign responsibility to some device that is antecedent to, and 
causally responsible for, tfte^said phenomenon. " The device has available to it 
"representations"' that have cfiar^cteristics very much like the phenomenon we. 
are trying to 'understand. As philosophers have often told us, "representa- 
tions" require users with goals ana^iqterests (much like the animal itself) 
and so, when we&> assume their presence, we>t^ke out a loan on intelligence that 
must ultimately be- paid back .(cf. Dennett" 19^8; Searle, 1980). *We can bury 
our heads in the sand on this issue or we caft .approach the problem in a 
different way: one that asks not how control can^ be explained according to 
s&me a priori prescription for the system (such-as the central representations 
and the cybernetic, negative feedback- paradigms favored by Stein), but rather 
how control arises as an a posteriori . consequence of the system's dynamical 
organization. % 

For example, imagine adopting the former, prescriptive strategy to a 
coherent biological phenomenon such as the schooling of fish. What we observe 
af*e individual fish behaving" collectively in a highly coordinated manner. The 
"system" in this case has many degrees of freedom and exhibits an organized, 
seemingly wholistio structure. Adopting a prescriptive strategy, we might* 
search the system for a "reference value" or a "central representation 11 that 
regulates the individual fish or the collective of fish, but it would make 
little sense to do so. These would be special mechanisms introduced by the 
unknowing observer to account for a poorly understood phenomenon. In fact, 
tfie highly coherent behavior of fish schooling can be accounted for with a 
fairly small set of key variables, such as "density" defined through the 
metric of fish iength. When the average distance between nearest neighbors is 
less than bne fish length (note that the metric is "intrinsic" and system- 
scaled; cf. Warren & Shaw, 1981), spacing between fish is schooled not random 
(cf. Okubo, 1980, for an in-depth analysis)* 

.^Although the details of collective 'fish behavior may seem far removed 
fronf^the issues -aised by Stein, there is, we think, an important message for 
the neuroscientist or psychologist. It is that an understanding of a complex, 
organizational phenomenon such as fish schooling rests^ with articulating the 
necessary and sufficient conditions for that organization to occur. More 
generally this approach entails a strategy that rejects the introduction of 
special mechanisms — as sources of explanation — before dynamics has been fully 
explored. Put another way, what can we, as students of movement, explain "for 
free" before we burden the nervous system with the onus of control? 
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/In this regard, it is puzzling to us that Stein chooses to ignore a model' 
whose dynamics obviate (or at least significantly reduce) the requirement for 
ongoing, computational control* If recent work is a guide, much may be gained 
through the identif icatiorr of functional units.; of movement with nonlinear 
mass-spring systems. Although the model has received an uneven interpreta- 
tion, its ^-import for us *is that it allows one to. see the qualitative 
similarities between certain aspects of movement control (such as the ability 
to reach the same desired ?patfal location, with different trajectories and 
from variable initial conditions) and the behavffcr of a mass-spring system. 
Following our arguments expressed above, the beauty of the mass-spring model 
lies not in the literal parallel between a single muscle and a spring, but in 
the recognition that particular behaviors shane r — to a first approximation — the 
same- abstract functional organization as a mass-spring rsystem. 

The intuition that a muscle- joint system is dynamically similar to a 
mass-spring system with controllable equilibrium length is due to Fel ! dman 
(cf. Fel ! dman, 1966, p. 771). and has undergone appropriate extension by a 
number of .authors (e.g., Bizzi, Polit, & Morasso, 1976; Kelso, 1977; Kelso & 
Holt, 1980; Polit & ^izzi, 1978; Schmidt i.McGown, 1980); The basic idea -is' 
that a given joint angle may be' specified according to a set of muscle 
equilibrium lengths. Once these, are specified, the joint wiM achieve and 
maintain a desired final angle at which the torques generat^i by the muscles 
sum to zero. Such a system "exhibits the property of equifinality in that 
desired positions may be reached from various initial angles, and 'in spite of 
unforeseen perturbations encountered during- the motion trajectory (see Kelso, 
Holt, Kugler, & Turvey, 1980 for review; but also Saltzman, 1979. for some 
cautionary notes). Fel'dman (196*6, 1980) has further noted that stiffness at 
a- joint may be specified in , terms of agonist and antagonist equilibrium 
lengths even if the stiffness of these muscles is not itself controllable. In 
the Fel'dman model, joint stiffness covaries with the degree of agonist- 
antagonist co-contraction. 

<? , 
Two points for Stein emerge from this discussion. One concerns a sin of 

omission in that he includes the spring . property of stifiness as a possible 
control variable, but neglects the related variable of equilibrium length. 
The other, perhaps more important issue warrants a little further development^ 
because of its theoretical consequences. " It is that in likening (to a first 
approximation) a constrained collective of muscles to a mass-spring system, 
the need to introduce externally imposed-, measurement , comparison, and control 
operations is reduced. Though we could describe a dynamical system lil^e a 
mass-spring in terms of externally imposed reference J.evels and though we 
could mathematize it into canonical feedback form, little would be gained by 
doing so (cf. Yates, in press). A muscle collective qua spring system is 
intrinsically self-equilibrating: conserved values such as the equilibrium 
point emerge from* the system's parameterization. More emphatically, in mass- 
spring systems (like schools of fish and functional groupings of muscles?) 
there iz no need to introduce a "representation" anywhere*. 

Toward an Alternative Control Scneme 

In our final*' comments we take up — in rather condensed fashion because of 
space limitations — Professor Stein's invitation to his critics to offer an 
alternative scheme to the one that he has put forward so authoritatively. We 
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refer to an emerging theoretical view of movement control and coordination 
that has been expressed in' two recent papers (Kugler et al. t 1980 f 1982) and 
that has also undergone some f as yet limited, empirical scrutiny (Kelso f Holt f 
Kugler f & Turvey f 1980; Kelso f Holt f Rubin, & Kugler f in press). Its origins 
(and to a large extent its appeal) lie in a unified treatment of cooperative 
.phenomena at all scales of magpj^ude (cf. Haken f 1977). Hence it speaks to 
the^ important issue raised by ^Stein at the beginning of his article, namely 
that of relating levels of analysis. Moreover, the perspective is consonant 
.with- some -of- the themes introduced above and also may interface^with evolving 
oscillator theoretic views of neural control not considered by Stein in the 
target article (e.g., Delcomyn f 1980), 

A chief distinguishing feature of the view expressed here lies in the 
recognition that first and foremost, living systems belong to a class of 
physical systems that are .open to fluxes of energy and matter (in -contrast, 
cybernetic systems are. closed to energy and matter exchange with their 
surrounds). The. principal theories addressing such systems are Iberall 1 s 
Hbmeokinetic Theory (e.g., Iberall, 1977, 1978; Soodak & Iberall, 1978; Yates 
. & Iberall, 1973) and Prigogine's Dissipative Structure Theory (e.g., Nicolis & 
Prigogine, 1977; P^igogine, 1980). The former, - in particular, addresses 
systemic phenomena in biology and elaborates, among other things, the condi- * 
t ions - for persistence of function, autonomy, and self-organization. It 
represents a concerted effort to apply irreversible thermodynamics to living 
. -systems. A fundamental tenet is that in steady-state systems the flow cf 
energy through the system plays an organizing role and that, following 
Morowitz's theorem, .energy flow from a'potential source to a lower order sink 
will lead to at least one cycl£ in the system (cf. Morowitz, 1968).' 
Homeokinetic theory builds on the Barnard-Cannon principle of homeostasis 
(which contained -no mechanism for preservation of conserved states); in the' 
homeokinetic vie#, control is dynamically effected , by means of coupled 

• ensembles of limit cycle oscillatory processes. Limit cycles represent the 
"only temporal stability for non-conservative, nonlinear systems; they resemble 

"squirt" systems that, by virtue of their design, are capable of making up for 
dissipative losses that occur in the drift toward equilibrium (see Yates & 
Iberall, 1973). The system's conserved values or equilibrium operating points 
* are thought to be specified in the loose coupling of limit cycle processes. 
Limit cycles are manifestations of . thermodynamic engines and quantize action ^ 
(formally, the product of energy and time; cf. Iberall, 1978) at every level 
in the system. 

As functional units of movement, ensembles of nonlinear limit cycle 
oscillators offer a nugj>er of attractive features for a principled account of 
coordination and .control, r- Among these are their self-sustaining properties, 

• th^ir ability to operate independently of initial conditions, their stability 
in the* face of moderate perturbations, "and, perhaps most important for the 
theorist of movement, -the properties of mutual entrainment and synchronization 
(cf. Minorsky, 1962; Winfree, 1980). ; 

With respest to the issues raised by Stein\ it iS worth emphasizing that 
- limit cycles are- not special mechanisms £er ' se . To " observe spectrally 
distributed limit cycle regimes and v for new spatiotemppral organizations to 
emerge,- certain necessary conditions must exist. Among these are the presence 
of many interacting degrees of freedom, nonlinearities, a - relatively constant 
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source of potential energy and the requirement that energy be dissipated. 
Given such conditions and subject to critical scaling influences, constraints 
emerge that are capable of marshalling the free variables into coherent 
functions. Quadruped gait may be an example: When one stable movement 
pattern is driven beyond a critical value on a system-sensitive parameter, a 
bifurcation occurs and a new spatiotemporal pattern — a new stability — arises. 
In such a view, no explicit "gait selection" process is required (e.g., 
Gallistel, 1980). * " * 

To reiterate our main point, however, in the perspective offered here, 
order (control and coordination)' is functionally specified in the system's 
dynamics. The radical claim, as Gibson (1979) once remarked, is that behavior 
is regular but there c.re no regulators. A less radical statement, would be an 
affirmative answer to Yates 1 (1980) question to the readers of the American 
Journal of Physiology : Do [you] know of a serious effort to discharge the 
homunculus? 

The spirit of the foregoing discussion leads us to raise one final issue. 
It is the growing intuition — stemming from theoretical considerations raised 
he-e and elsewhere (cf. Anderson, 1972)— that the problem of order in natural 
systems might be attacked more effectively by seeking out a single set of 
physical principles that can apply at all levels, rather than by positing 
different units of analysis c at each level. One assumes nature operates with 
ancient themes. In this commentary we have tried to provide a flavor for the 
ones that neuroscience irv general, and the field^ of motor control in 
particular, might consider worth orchestrating. 
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EXPLORING THE NVTURE OF MOTOR CONTROL IN DOWN'S SYNDROME* 
Walter E. Davis- and J. A. Scott Keiso++ 

Abstract * Following Asatryan and Fel'dman (1 965), two -experiments 
were conducted to describe the so-called invariant mechanical 
properties underlying movement control .in Down's syndrome and normal 
subjects. The invariant characteristic is a curve on a graph of 
joint torque versus joint angle. The task required subjects to 
maintain a steady joint angle against an external load ( torque) . 
Torque was systematically changed via partial unloading in order to 
obtain torque by length (joint angle) functions at three separate 
initial joint angles. Instructions required subjects "not to 
intervene" when unloading occurred in Experiment 1 and to "tense" 
their muscles prior to unloading in Experiment 2. Both normal and 
Down's syndrome groups revealed systematic torque by length 
functions that might be expected according to a simple mass-spring 
system model. Although the gross organization of movement in Down's 
syndrome subjects was nearly the same as normals, important 
differences between the two groups were found. Down's syndrome 
subjects revealed underdamped motions relative to ncrraals.(as shown 
by differences in the degree of 5 oscillation atout the final 
equilibrium position) and were less able to regulate stiffness (as 
shown by" differences in slope of the to.rque by angle functions in 
Experiment 2).' We promote the notion that damping and stiffness may 
be sensitive indices of hypotonia- -the most common description of 
neuromuscular deficiency in Down's syndrome. 

There is very little research on the control and coordination of movement 
, in Down's syndrome subjects. The few studies that do exist reveal that this 
* population exhibits abnormal gait (James, 1974), slow movement responses 
(Berkson, 1960; Lange, ,1970) and is less accurate on certain motor tasks 
•(Frith & Fritfi, 1974) than their normal counterparts. However, questions 
concerning the underlying organization of motor control in Down's syndrome 
subjects remain unanswered. 
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The most often implicated motor deficiency in Down's syndrome subjects is 
hypotonia, a decrease or absence of muscle tone* Indeed, nearly all Down's 
syndrome infants are born with hypotonia, a condition that may contribute 
greatly to their delay in reaching motor milestones (cf. Cowie, 1970). 
However, no definitive statement can be made about hypotonia in Down's 
syndrome individuals beyond infancy (cf. Knight, Atkinson, & Hyman, 1967; 
Owens, Dawson, & Losin, 1971). To complicate things, the exact relationship 
between hypotonia and motor control is, as yet, unclear (cf. Neilson & Lance, 
1978). 

One model — for which there is accumul v ating 'empirical evidence— and which 
may allow us to understand movement organization in Down's syndrome, draws the 
analogy between the uehavior of muscle collectives and a vibratory system 
(cf . Turvey, 1977). For example, Asatryan and Fel' dman (1 965), in a now 
classic experiment had subjects establish steady-state joint angles against an 
external load. On each occasion, when the load was systematically reduced and 
the subjects instructed not to intervene, the joint angle changed. Resulting 
torque by length (joint angle) functions generated from different initial 
positions were parallel and non- intersecting as would be expected from a mass- 
spring model. 

Additional evidence supporting a vibratory system model comes from 
research demonstrating equifinality in the muscle-joint systems of human 
subjects (e.g., Bizzi, 1980; Fel'dman, 1966b; Kelso, 1977; Kelso & Holt, 1980: 
'Schmidt & McGowan, 1980, for reviews) and trained monkeys (Bizzi, Dev, 
Morasso, & Polit, 1978; Bizzi, Polit, & Morasso, 1976). Equifinality, as 
defined, refers to a system's ability to equilibrate based solely on its 
parameters and independent of initial conditions (von Bertalanffy, 1 973 ) * 1 

Following Asatryan and Fel 1 dman' s (1965) analysis, the present study 
sought to describe the static mechanical properties of the muscle joint system 
of Down's syndrome subjects using a simple mass-spring equation: F = -K (l- 
1 0 ) , where F is an external force, -K is stiffness, 1 is the current length of 
the spring and 1 0 is the length of the spring when no forces are acting on it. 
Experiment 1 was designed to determine the so-called invariant characteristics 
(cf. Fel'dman, 1980a) of the muscle- joint system of Down's syndrome subjects. 
This was achieved by examining the relationship between change in joint angle 
under conditions of partial unloading when subjects were instructed not to 
intervene voluntarily against the unloading, thereby holding the parameters of 
stiffness and zero length relatively constant. Under these conditions, any 
change in joint angle should be systematic with the change in the load 
(torque). 

A second experiment examined the extent to which Down's syndrome subjects 
were capable of regulating stiffness by requiring subjects to ^ense their 
muscles voluntarily* at particular joint angles prior to unloading. Asatryan 
and Fel'dman (1965) demonstrated that normal subjects could voluntarily 
increase the stiffness of the muscle- joint system as reflected by increases in 
the slope of the torque by length function. It is not known whether stiffness 
can be regulated voluntarily by Down's syndrome subjects. Qualitatively 
speaking, the Down's syndrome population is characterized by hypotonia 
(flaccidity) , which may relate to muscle stiffness or to damping. Damping is 
the internal fractional force present in the system and is indicated by the 
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extent of overshoot or oscillation about the equilibrium point. As this study 
shows, both stiffness and damping appear to be sensitive indices of the 
impaired performance of Down's syndrome subjects. 

EXPERIMENT 1 
Method 

Subjects 

The subjects in this study were seven Down's syndrome male students 
between 14 and 21 years of age who attended Celentano School for the 
developmentally handicapped in New Haven, Connecticut. These subjects were 
selected from among 30 subjects who participated in a previous study that 
employed limb localization movements (Davis & Kelso, Note D. The subjects 
were selected on the basis of their ability to complete the task. The 
Celentano School administration reported a'n IQ range of 25-60 for the subjects 
involved in this study. A control group consisted of six normal male subjects 
selected from the subject pool listed at Haskins Laboratories in New Haven who 
were paid for their services. 

Apparatus 

The apparatus consisted of a 593c finger positioning device along with an 
associated electronics control package (see Kelso & Holt, 1980, for a 
description). The main parts of the apparatus were two. movable arms each 
attached to a separate metal shaft mounted vertically on the top of an open 
box frame. Only the right hand was used in these experiments. The frame was 
mounted on a table 78.5 cm hi^h. 

The movements allowed by the positioning device were flexion and 
extension of the index finger about the metacarpophalangeal joint. The distal 
end of the moving finger was fitted with a plastic collar that slipped into an 
open-ended cylindrical support attached to the movable arm. The movable arm 
consisted of two parallel bars fitted perpendicularly into the metal shaft. A 
pointer was attached to the end of the movable arm and moved along a 
protractor calibrated in degrees. The apparatus was also equipped with padded 
adjustable braces with which ~to secure the subjects' wrist, hand, and 
remaining fingers and thumb during the movements. 

The gear arrangement driven by a torque motor provided resistance (load) 
to the movable arm (and hence to the finger when placed in the cylinder 
attached to the arm) when current was supplied to the motor. The electronics 
control Box allowed for regulation of the current supply ^ d could *» in 
either of two modes. While in the first mode, which shall be referred to as 
"servo-torque control," a resistance (torque) acted on the finger whenever the 
finger deviated from the set servo-position. When the finger was i in i the 
proposition, no resistance acted upon it. When in the second mode 
referred to as "torque control." a constant resistance could be applied to the 
finger (settable in either direction). The amount of resistance (torque) was 
adXable and could be set anywhere from 0 to 100* of the maximum torque 
available from the motor (81.6 ounce-inches). 
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Procedure 



Procedures- for. the normal and Down's syndrome subjects were identical. 
The normal subjects were tested at Haskins Laboratories and the Down 1 3 
syndrome subjects were tested at Celentano School over a four-day period. 
Each subject was scheduled for "a 15-minute session each day. As a consequence 
of the difference in venue, Down f 3 syndrome subjects' movements were analyzed 
via visicorder tracings (Honeywell, 1508C) while normal subjects' movements 
were fed directly into a PDP 11/45 for later computer analysis. For both 
types of recordings, ,a set number of degrees of movement corresponded to a 
calibrated voltage. 

Each subject sat comfortably facing the apparatus with his right finger 
placed securely into the cylinder attached to the arm. Upon a signal from the 
experimenter, the subject moved to a specified steady-state joint angle 
(target) acting against _a load supplied by the torque motor. The target 
angles for Experiment 1 were as follows: S1 = 155 deg, S2 = 165 deg, and S3 = 
175 deg. When the subject's finger pointed straight ahead, the position was 
equal to 180 deg. The subject reached the target angle with a flexion 
movement from a starting position of 192 deg/ The target angle was set by the 
subject by matching a line on the oscilloscope. When the angle was set, the 
subject closed his eyes and the oscilloscope was turned away. Within 0.5 to 
2.0 sec after the target angle was achieved, a proportion of the load was 
released. The subject was instructed to maintain a steady resistance against 
cne load — as indexed by the stable achievement of initial angle — and not to 
interfere voluntarily with the movement of the finger if the load was 
released. 

For this study, partial unloading was achieved in the following manner. 
The control was set in servo-torque mode at the specified servo-position 
corresponding to the starting position as noted above. After the subject 
moved his finger to a specified joint angle, counteracting the resistance 
(100% torque) exactly, part of the load was released by switching from servo- 
torque control to torque control. This was achieved by the experimenter 
moving the manual switch on the control box. The amount of load released was 
regulated by setting the percentage of torque load. In order to obtain 
negative load release, the torque load was set at 5 and 10 percent but the 
direction of the resistance was reversed to act in the direction of flexion 9 
(i.e., in the direction that the finger was moving). 

The measurements taken were changes in joint angle and changes in the 
amount of resistance acting on the finger. Measurements of joint angle change 
were taken by hand from visicorder tracings of finger displacement and 
* measured to the nearest .5 cm, or obtained directly following A-to-D 
conversion (200 Hz) on the PDP 11/45. The criterion for determining the new 
steady-state joint angle was the point at which movement ceased (as shown by 
the' movement tracings) for at least 500 msec (.5 cm on the tracing) following 
unloving. 

A series of partial unloadings was conducted with each subject at each of 
three initial joint angles, S1, S2, and S3. For each series, seven separate 
unloading* were conducted (60J, 40J, 25*. 10» t 0J, -5*, and -10*), 
representing <a percentage of maximum torque. The seven separate unloadings 
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provided a sufficient amount of data to describe the torque by joint angle 
functions. 

For each of the unloadings at least four trials were carried out. In 
ell, more than 184 trials were conducted with each subject. The order in 
which the ti als were given within each series, 'or the different unloadings, 
was randomly assigned. -Series 1, 2, and 3 were presented to each subject in a 
predetermined balanced order. 

Design 

The data obtained from both Down's syndrome and normal subjects were 
graphed to determine if the torque by joint angle functions were parallel and 
non- intersecting a3 might be expected from the static equation, F = -K (l-l 0 ) # 
The algebraic error data were analyzed using a 2 (groups) x 3 (series) x 7 
(unloadings) analysis of variance with repeated measures on the last two 
factors. A similar o analysis was performed on variable error. In addition, 
tests of linearity (Wine> 1964) were also performed on the curves. 

Results 

Three sets of torque by joint angle functions were obtained as a result 
of partial unloading* The joint angle change in this case represents a 
deviation from the initial angle and conforms to standard measures of mean 
algebraic (constant) error. The results for the normal and Down's syndrome 
subjects are shown in Figure 1. The curves obtained for both normal and 
Down's syndrome subjects are indeed parallel and non- intersecting. It is 
clear from the figure that the two groups are nearly identical, and this was 
borne out by statistical analysis, F 0,11) = .20, p > .05. As expected, 
there was a significant torque change effect, F (6,132) = 173.8, j> < , as 

well as a significant series effect, F (2,132) = 6.23, < .01. There was 
also a significant series by torque interaction, F (12,132) = 2.65, J> < * 01 • 
Analysis of this interaction revealed that for the three greatest unloadings 
(0%, -5%, and -10#), the joint angle change was larger between series 3 and 2 
as well as series 3 and 1* Series 2 also revealed larger joint angle changes 
than series i at these unloading values. 

Analysis of variable error, like constant error, reflected a high degree 
of similarity between Down's syndrome' and normal groups. Only the torque 
effect was significant, F (6,132) = 25.67, < - 001 • For the seven different 
unloadings (from 60% to -10#) variability in angle change appeared to increase 
systematically (1.0, 1.7,, 2. 4, 3.2, 3.8, 4.4, and 4*1 deg) . No other main 
effect or interaction was significant for variable error. 

A trend analysis of the F by 1 function obtained from each of the three 
series of unloadings for Down's syndrome and normal subjects was performed. 
The results revealed that the functions for both groups were essentially 
linear. For example, for the Down's syndrome group the proportion of variance 
accounted for by linearity in series 1, 2, and 3 was 94%, 91%, and 92%, 
respectively. 
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FIGURE 1. ANGLE CHANGE AS A FUNCTION 
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Examination of the individual movement tracings, however, revealed some 
interesting differences. Representative movement tracings^ of individual 
subjects are presented in Figure 2A (normal subject Sn£ and 2B (Down's 
syndrome subject BV). The three movement tracings for e^ch subject were 
recorded during unload ings of 60% f \0%, and -10$ in Experiment 1. 
' Qualitatively speaking, movements of'norraal subjects from the initial position 
to the target angle we're, on the whole, more direct than the Down's syndrome 
subjects whose movements were more step-like or discontinuous (e.g., Brooks, 
1974) and less stable. Although movement speed was not specified to the 
subjects beforehand, a rather interesting finding is that Down's syndrome 
subjects took significantly longer to reach the target angle. A random sample 
of 136 trials from Down's syndrome subjects was compared to the same number of 
trials for normal subjects. Th$ mean movement time for the Down's syndrome 
group was 2.0 sec and the standard deviation was .4 sec compared to the mean 
and* standard deviation of .8 sec and .1 sec for v normal subjects. These 
movement times were significantly d if feren^from each other (p < .001). 

Down's syndrome subjects also differed from their normal peers in terms 
of oscillatory behavior at the newly established equilibrium position (i.e., 
following unl&ading, see Figure 2). Overshoot was measured to the nearfest .5 

, degree for each of the trials and was analyzed using a group by series by 
torque analysis of variance with repeated measures on the last two factors. 
As shown in Table 1 a significantly greater amount of overshoot was found for 
Down's syndrome than for normal subjects, £ (1,11 ) - 21.38, j> < .001. There 
was also, a significant series main effect, ]? (2,132) = 6.32, j> < ^ and 
torque main effect, £ (6,132) = ?8.67, j> < * 001 • The * mean overshoot f° r each 
series was 4.87, 5*48, and 6. 91 deg, respectively. As the amount of unloading 
increased (and thus angle change increased), the amount of overshoot 

' increased. This finding holxls for both groups but is magnified in the Down's 
syndrome group as evident in a group by torque interaction, F (1,132) = 17*27, 
j> < .001 (see Table 1 ). . 

Discussion 



The results of Experiment 1 support the notion that when muscles are 
constrained to act as a un'it in controlling movement about a joint, that unit 
behaves qualitatively like a mass-spring system. The three sets of torque by 
joint angle functions obtained for both normal and Down's syndrome subjects 
were parallel and non- intersecting, and concur with the findings of Asatryan 
. and Fel'dman (1965) % . It may be r.easoned that, for the subjects in this study, 
the -K and 1 0 parameters were established in counterbalancing the external 
force to maintain the specified joint angle. A-sys tematic angle change 
accompanied the systematic torque change. Thus, it appears that the 
parameters of -K and 1 0 remained relatively constant during unloading. 
Furthermore, when the subjects were asked, on different occasions, to reach 
new joint angles-, new zero angles were established. The change in zero length 
resulted in parallel and non- intersecting functions. 

s * 

Of cour:5e the significant result of the present experiment '.s that the 
torque by joint angle functions for Down's syndrome subjects and a normal 
group were practically identical (see Figure 1). It appears therefore that 
the underlying movement organization, at least under static conditions, is 
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FIGURE 2. TRACINGS OF FINGER MOVEMENT 
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basically similar in both populations. 'From a- mass-spring perspective, Down's 
syndrome subjects, like normals, appear capable of ' specifying system 
parameters— stiffness and equilibrium length— that may determine movement at 
the joint. w 

Although the curves appear graphically non-linear, the torque by joint 
angle functions were characterized by a statistically linear trend. Thus , the 
non-linear component in this study is somewhat: less than might be expected 
based on the findings and qualitative analysis of Asatryan and Fel'dman 
(1965). However, these authors' did not subject their obtained : functions to 
any statistical analysis as we have done here. 

One way of interpreting the present findings is that the nervous system 
(in both Down's syndrome and normal' populations) is able to "tune" the muscle- 
joint system by adjusting the length-tension relationships of the muscles 
involved. In the. simple case, agonist-antagbnist pairs can be represented, by 
parallel length-tension curves whose slopes correspond to muscle' stiffness. A 
change in innervation rate to one muscle^ or the other will shift the 
equilibrium -point of; the system (of. Bizzi", 1980;. Cooke, 1980; Fel'dman, 
1966b, 1980a, 1980b; Kelso & Holt, 1980). In this view, supported by the 
ability of deafferented monkeys (Bizzi et al., 19.78; .Bizzi et al., 1976) and 
humans wtthout intact joint and cutaneous reception (Kelso, 1977; Kelso & 
Holt 1980) to localize limbs -accurately, stiffness is set prior to movement 
and 'is a control parameter. On the other hand, Houk (1978) has presented 
evidence in favor of a view in which a combination of muscle spindle and 
tendon proprioceptors provides feedback about muscle stiffness. In this view* 
stiffness is a regulated variable of the system. Regardless of which view one 
adopts both are consonant with the perspective' offered here (but see the 
General Discussion for possible quslif ications on this view, and also Cooke, 
1980 for a model of how mechanical and reflex variables may interact). It is 
the specification of dynamic variables (e.g., stiffness, damping) rather than 
kinematic variables (e.g., displacement, Velocity) that appropriately 
characterize" the neuromuscular 'organization of the muscle- joint system. 

/ 

There were, however, qualitative differences among the movement patterns 
of Down's syndrome and normal subjects. There were- clear differences between 
the graphs in the trajectories toward the target angle, (see Figure 2). 
Moreover, Down's syndrome subjects were less able, after; reaching the target, 
to maintain a steady position. One possible explanation for the Latter 
finding is that once the target was reached, visual guidance was removed. In 
a previous study (Davis & Kelso, Note 1), it was found that Down's syndrome 
' subjects were less able than normal subjects to reproduce movements accurately 
without visual guidance. However, the movements of /Down's syndrome subjects 
in the present study were also less smooth and accurate when visual guidance 
was available. In the present study, movements were made by matching a cursor, 
to a fixed line target on an oscilloscope screen. Visual guidance from an 
oscilloscope may not be the same as direct visual guidance of the finger. 
Nevertheless, under the present conditions of visual guidance tc > a target as 
well as maintaining a set joint angle with visual information absent, Down s 
syndrome sub jects. were not as accurate as their nprmal peers. 

Second, significantly greater overshoot or fluctuation -about the 
equilibrium, point was found in Down's syndrome subjects following unloading. 
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Table 1 



Mean Overshoot (in degrees) for Each Group as a Function of Torque Loading 



X of Torque Unloading 



60% 40% 25% 10% P% -5% -10% 

Mean Sd Mea/ Sd Mean Sd I'ean Sd Mean Sd Mean Sd- Mean Sd 



Experiment 1_ 

" Down 
Syndrome 



0.2 0.3 1.6 1.6 4.2 3.0 9.9 4.3 14.3 5.4 17.1 6.2 18.0 5.2> 



Normal 



0.1 0.1 0.1 0.1 0.7 0.5 1.6 1.1 3.3 1.6' 4.2 ,2.1 5.4 2.2 



Experiment 2 
Down 

Syndrome 0.1 0.1 

Normal 0.2 0.3 



3.5 2.3 
0.9 ^0.5 



9.4 4.9 
1.3 1.1 



12.5 5.5 
1.1 1.9 
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Overshoot may be taken as an index of the damping parameter . For example, an 
underdamped system will fluctuate about the equilibrium position and, for most 
purposes, is unstable. On the other hand, an overdamped system exhibits 
slowed movement speed and no oscillation about the equilibrium point.' A 
critically damped system is one in which the movement will reach the 
equilibrium position in the fastest possible time. Most human muscle joint 
systems appear to be damped just under critical (cf. Kelso & Holt, 1980; 
Neilson & Neilfcon, 1978). Neilson and Neilson (1978), for example, found 
their normal subjects did not excoed five percent of the movement arc in 
overshoot during a rapid voluntary movement. 

The results of our Experiment 1 conform to the above, figure; overshoots 
was found to be Q.W of\ the total movement arc for all normal subjects 
movements. Different amounts of overshoot may be due to the differences in 
the movement . conditions between the present study and that of Neilson and 
Neilson (1978). Overshoot was measured iff the Neilson and Neilson study 
- following a rapid voluntary movement. In our study, however, overshoot W3S 
measured' following unloading during which an active haloing of the limb was 
assumed not to have occurred. It setms reasonable, therefore, to expect some 
difference in overshoot between the normal subjects in our study and those in 
the Neilson and Nelson (1978) study. " Perhaps the more important finding is 
that Down's syndrome subjects, in sharp contrast to normal subject*, appear to ~ 
behave in an underdamped manner, as suggested by the 27. 4* of overshoot found 
f or th i s - group . 

Although the results of Experiment 1 revealed that the torque by joint 
angle functions were near-identical for Down's syndrome and normal subjects, a 
question remains as to whether both populations can alter the slope ox the 
functions (specify stiffness) to a similar degree. Following Asatryan ana 
Fel'dman (1965), one way to examine this question is to, require subjects to 
tense the muscles voluntarily prior to unloading. "Stiffening" the muscle- 
joint system in this manner should reduce the amount of absolute joint^angle 
change, thus increasing the slope of the torque- joint angle functions. In 
Experiment 2, the foregoing strategy was employed to determine whether Down s 
^syndrome Individuals could control stiffness as effectively as normal 
subjects. 



EXPERIMENT 2 



Method 



J 



\ 



The subiects, methods, and procedures in Experiment 2 were the same as in 
Experiment 1 except that subjects were instructed to tense (cocontract) their 
muscles in an effort to maintain the joint angle against the perturbation. 
Each subject was given some practice prior to the experiment proper to ensure 
that the instructions' were understood. In the experimental trials on 
reaching the target angle, subjects were asked to "stiffen their muscles" in 
order to maintain the joint angle. 

Only four separate unloadings (60*. 25%, 0%, and -10*) were used for 
Experiment 2. These four unloadings provided a sufficient description of the 
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FIGURE 3. ANGLE CHANGE AS A FUNCTION 

OF TORQUE CHANGE 
EXPERIMENT 2 
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torque by. joint angle function without inducing undue fatigue. For each of 
the unloadings, at least four trials were carried out and more than 48 trials 
were conducted for each subject. 

Data were graphed as in Experiment 1 and analyzed as before using a 2 
"group) x 3 (series) x 4 (unloadings) analysis of variance with repeated r 
measures on the last two factrrs. 



Results 



Changes in joint angle under "muscle stiffening 11 conditions are shown in 
Figure 3 for normal and Down f s syndrome subjects. Visual inspection of the 
figure suggests some potential difference in the torque- joint angle functions 
of normal and Down's syndrome subjects. Although overall differences between 
the groups just failed to reach significance, F (1,11) = 4.35, £ > .05, there 
was a significant torque effect, F (3.66) = 69.86, £ < .001, and group by 
torque interaction, F (3,66) = 4.61, £ < .01. Inspection of the means shows 
that joint angle increases' as a function of the magnitude of unloading, and 
that this effect is greater for Down's syndrome subject* (see Table 1 r. The 
mean angle changes for each unloading for Down^ syndrome subjects were 1.42, 
12.40. 23.49. and 29.52 deg and 1.29. 8.08, 14.66, and 17.84 deg for normal 
subjects. 

A series by torque interaction was also found, F (3,66) = 3.13, £ < .01. 
As in Experiment '1, at the higher levels of unloading (the last two in 
Experiment 2) the magnitude of angle change increased between series 1 and 2 
and between 2 and 3. 

There were no significant differences between the groups in the standard 
deviation of mean angle change, F (1.11)'- 3.18. £ > .05. However, there was 
a significant torque effect, F (3.66) = 20.62, £ < .001 and a significant 
torque by series interaction. F (6.66) = 4.76. £ < .001. Variability . 
increased as" the magnitude of the unloading increased but was significant only 
in series 1 and 3. 

Representative movement tracings from Down's syndrome and normal subjects 
in Experiment 2 are shown in Figure 4. Qualitatively, the movements of the 
subjects are similar to those in Experiment 1. Down's syndrome subjects 
appeared less capable of reaching and maintaining the target angle (compare 
Figures 2 and 4). 

. Likewise, overshoot or oscillation about tha equilibrium poire following 
unloading was amplified in Down's syndrome subjects (see Table 1) and «as 
revealed in a group main effect, F (1.11) = 8.88. £ < .05. The mai i effects 
of series. F (2.66) = 5.96. .£ < .01 and torque. F (3.66) = 1.82. £ < .001. 
were also significant. Increases in overshoot occurred from series 1 to 2 and 
from 2 to 3. and were also evident as the magnitude of unloading in ;reased. 
Increases in overshoot held for each series in Down's syndrome subj but 
not normals, and resulted in a significant group x series inters Jtion, F 
(2.66) =4.39, p < .05. Significant group x torque, F (3,66) = 8.3, 2 < '°°\' 
and series by torque, F (6,66) = 3.95. £ < .01. interactions were a, so found. 
As the unloading increased, the degree of overshoot increased; this effect was 
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magnified for Down's syndrome subjects and for series 2- and 3 (see Table 1, 
Experiment 2). 

Discussion 

In the second experiment it was demonstrated that Down's syndrome 
subjects were able to increase the stiffness parameter voluntarily when asked 
to tense their muscles against the load change. This capability's shown by 
the increase in slope of the torque by joint angle functions from Experiment 1 
to Experiment 2 (compare Figures 1 and 3). It is well known that an increase 
in activation of motoneurons increases tKe stiffness property of the muscle 
(Agarwal & Gottlieb, 1977; Andreeva & Shafranova, 1975; Barmack, 1976; Houk, 
Singer, & Goldman, 1970; Rack, 1969; Safronov, 1970). An increase in 
stiffness can occur without an increase in force or without a change in joint 
angle. r 

Normal subjects were also capable of increasing stiffness and apparently 
to a somewhat greater extent than Down's syndrome subjects. The mean slopes 
of the functions generated in both experiments were 1.77 and 3.56, 
respectively, for Down's syndrome subjects and 1.83 and 4.67 for normal 
subjects. There was'no statistically significant difference between groups on 
overall angle change in Experiment 2; however, at higher levels of torque 
change, significant differences between groups did exist as revealed in the 
significant group by torque interaction. 

There were also some noteworthy differences among individual subjects 
that warrant some discussion. For example, Down's syndrome subject p's 
stiffness characteristics were closer to that found for normal subjects (see 
Figure 5) and, in fact, exceeded some of the normal subjects. Likewise, one 
normal subject, GA, displayed mustfle stiffness more aligned to the Down's 
syndrome group than to the normal group (see Figure 5). If these extremes are 
excluded from the groups, the difference between Down's syndrome and normal 
subjects is magnified. That wide individual differences among subjects exist 
within both groups is expected. It is known that stiffness varies between and 
within normal individuals (Safronov, 1970). Also, extreme individual 
differences among Down's syndrome subjects on a number of psychological and 
physical variables have been found (James, 1974; LaVeck & Brehm, 1978). It 
can be reasoned that stiffness, as with many other variables, operat3s on a 
continuum rather than strictly dichotomizing the populations observed. 

Finally, there is some indirect evidence that suggests Down's syndrome 
subjects have reduced capacity of muscle activation that might be associated 
with the ability to regulate muscle stiffness. Most Down's syndrome infants 
are deficient in the amino acid 5-hydroxytryptophan (Coleman, 1973; Koch & de 
la Cruz, 1975),' which is though^ to play an important role in neural 
transmission (McCoy, Segal, & Strynadka, 1975) and muscle contraction (Ahlman, 
Grillner, $ Udo, 1971 ).~ The. finding that ; Down' s syndrome subjects have slower 
movement response times (Lange, 1970) may support the idea that Down's 
syndrome subjects are less able to activate their muscles,- If, as some have 
claimed (c:f. Lestienne, Polit, & Bizzi, 1980), integrated electromyography 
reflects active stiffness in muscles, then it seems worthwhile to subject the 
foregoing speculations regarding stiffness to further experimental test. 
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FIGURE 5. ANGLE CHANGE AS A FUNCTION 
§ OF TORQUE CHANGE 

A) DOWN SYNDROME SUBJECT LM B) NORMAL SUBJECT GA 




GENERAL DISCUSSION 

A major finding in this study is that the gross underlying organization 
of motor control in Down's syndrome subjects is qualitatively similar to 
normal subjects and can be described, to a first approximation, in terms of a 
vibratory system. We have found that the torque by joint angle curves, 
described as invariant characteristics of the system by Asatryan and Fel'dman 
(1965) (see also Fel'uman, 1980a f 1980b) f are obtainable from Down's syndrome 
subjects under two -conditions of maintaining a steady joint angle against a 
load. In one condition the subjects did not voluntarily intervene during 
partial unloading. In the other condition the subjects voluntarily tensed 
their muscles, which resulted in an increase in the active stiffness of the 
system. The "importance of obtaining the invariant characteristics (static 
muscle torque versus angle) in this special Down's syndrome population is 
magnified by the recent findings of Fel'dman (1980a f 1980b). With normal 
subjects, Fel'dman has shown that the Invariant Chrracteristic (IC) may 
^characterise the behavior t of muscle-joint systems not only during the 
maintenance of a steady posture (Asatryan & Fel'dman, 1§65) but also during 
rhythmic (Fel'dman, 1980a) and discrete movements (Fel'dman, ,1980b).- For 
example, in order to maintain a steady angle against changing loads, the 
system need onjLy shift from one IC to another. Referring to Figure 1, it can 
be seen that in order to maintain a 150 deg joint angle against a 60* load, 
the ic\f series *1*is used. In order to maintain the 150 deg angle when the 
load changes to 25X, a shift from the IC of series 1.to the IC of series 2 is 
required. x This transition from one IC to another appears to be effected 
through a change of the threshold angle at which motor units are recruited 
(Fel'dman, 1966a, 1980b; see also Crago, Houk, &• Hasan, 1976). 

By the same token, movements may be accomplished by shifts along the form 
of the invariant characteristics, that is, by shifts of the equilibrium point 
of the muscle load system and by changes in the form of the IC (Fel'dman, 
1974a, 1974b). The latter is shown by the set of ICs obtained in Experiment 1 
(Figure 1) and the set obtained in Experiment 2 (Figure 3). Through 
cocontraction of the antagonist muscles, the stiffness of the muscle load 
system may be increased and this is associated with increases in slope of the 
IC. It may be assumed that during movement, transformation from one set of 
curves to another is possible (Fel'dman M 1980a, 1980b). Thus, movements may 
be achieved /through simple changes in the parameters of the muscle-load 
system. According' to this view, amplitude of movement (position) may be 
regulated through changes in zero length, and velocity and acceleration 
through changes in stiffness (cf. Kelso & Holt, 1980). 

A -cautionary point worth emphasizing here is that the majority of 
experiments (including ours), and their consequent interpretation, deal with 
movements in a very restrained environment (e.g., sitting down with shoulder 
or wrist position fixed). New data reveal that the pattern of stiffness 
changes at a joint (s,ay, the arm) is mutable, depending in a significant way 
on the postural status of the subject (Nashnef, Note 2). Some modification 
(or even rejection) of the type of model proposed here for rather fixed 
actions at a joint may well be in order when more real-life situations are 
examined (e.g., a standing subject supported to varying degrees). 
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Although there appear to be overall similarities in the gross 
organization of motor control of Down's syndrome and normal subjects, there 
are notable differences in the precision with which Down's syndrome subjects 
attain target positions (see Figures 2 and *0. As shown by individual 
movement tracings and subsequent analysis, the movement patterns were 
qualitatively different between Down's syndrome and normal subjects in both 
Experiments 1 and 2. The findings here are consistent with other studies that 
show Down's syndrome subjects to be less accurate in controlling movements 
than their normal peers (e.g., Davis 4 Kelso, Note 1). 

Relatedly, and perhaps most important, Down's syndrome subjects differed 
from their normal peers in oscillatory behavior about the newly established 
equilibrium position ( i.e. , following unloading, see Figures 2 and U) . 
Oscillatory behavior is taken as an index of the damping parameter. As 
previously noted, the finding that underdamping characterizes the muscle- joint 
system of Down's syndrome subjects is consistent with the finding'that these 
subjects are less accurate in movement than normal subjects. It is the 
underdamping characteristic, in addition to stiffness, that may distinguish 
Down's syndrome from normal subjects. In this regard, it is important to note 
that although individual subjects GA (normal) and LM (Down's syndrome) were 
unlike their respective groups with respect to stiffness, they did not deviate 
fror group performance cn measures of damping. Mean overshoot for GA was 2.56 
deg compared to the overall mean of 2.2 for the normal group. Likewise the 
means for LM and the Down's syndrome group were 9.35 and 9o1. respectively. 
It is not known whether Down's syndrome subjects are able to modify the 
damping parameter r.hrough training. Other studies have suggested that nQrmal 
subjects can be trained to regulate damping during voluntary movement (e.g., 
Neilson & Lance, 1978). But further investigations are needed to determine to 
what extent Down's syndrome' subjects have this capacity. 

Overall, a number of findings concerning the motor control of Down's 
syndrome subjeccs are provided in this study and several avenues of research 
are suggested. We founcL-that the Down's syndrome subjects in this study' are 
not readily distinguishable from normal subjects in terms of gross movement 
organization; systematic torque by joint angle functions were obtained for 
both groups. Apparently, muscles are constrained to act as a unit in both 
normal and Down's syndrome subjects and this unit exhibits behavior — to a* 
first approximation—qualitatively similar to a mass-spring system (cf. Kelso, 
Holt, Kugler, 4 Turvey, 1980, for review). Where the groups differ, however, 
is in . the specification of stiffness, especially at high values of torque 
unloading and in the damping characteristic. It is interesting in this regard 
that a recent analog model — similar to the one under consideration here — 
characterizes hypotonia in terms of decreases in. resting stiffness (Cooke, 
1980). If our interpretation is reasonable, we may advance the hypothesis 
that it is a deficiency in setting damping and stiffness parameters that best 
characterizes the motor behavior of people with Down's syndrome — at least in 
simple, discrete movements. This view promotes a trend away from more 
descriptive terms, like hypotonia, that have been used up to now. 



lib 

140 



REFERENCE NOTES 



•7* 



1. Davis, W. E. & Kelso, J* A* S. Limb localization in Down 1 s syndrome 
subjects . e Unpublished study, Haskins Laboratories, 1979.. 

2. Nashner, t. M. Central and peripheral components of a mode? for the 
coordination of balance . Paper presented at the Sloane Fqundation Confer- 
ence on Coordination of Action, University .of California', San Diego, 
February, 1981. 

■ * 

REFERENCES 

Agar.wal, G. C, & Gottlieb, G* L. Oscillation of the human ankle joint in 

response to applied sinusoidal^ torque on the foot. Journal < of 

Physiology , 1977, 268, 151-176. ^ 
Ahlman, H., Grillner, S., & Udo, M. The effect of 5-HTP on the static 

fusimotor activity and the tonic stretch reflex af an extensor muscle. 
v Brain Research , 1971, 27, 393-396. 
Andreeva, E. A., & Shafranova, E. I. Muscular activity control* systems for 

Par kinson's^ disease. Automation and Remote Control , 1975, 36, 596-605. 
Asatryan, D. G., & Fel'dman, A. G. ' Functional tuning of the 'nervous system- 

with control of movement or maintenance of a steady posture I. 

Mechanographic analysis * on- the work of the joint on execution of a 

postural task. Biophysics ; 1965, 20, 925-935. 
Barmack, N. H. Measurements of stiffness pf extraocular muscle of the rabbit. 

Journal of Neurophysiology , 1976, 39, 1009-1019. 
Berkson, G. An analysis „of reaction time in normal and mentally deficient. 

young men. III. Variation of stimulus and of response complexity. 

Journal of Mental Deficiency Research , 1960,. 4, 69-77- 
Bizzi; E. Central • and peripheral mechanisms in motor control. In 

G. E. Stelmach & J., Requin (Eds.), Tutorials in motor behavior . 

Amsterdam: North-Holland Publishing Company 1980. 
Bizzi, E., 'Dev, P., 1 Morasso, P., & Polit, A. Effects of load disturbance 

during centrally initiated movements. Journal of Neurophysiology , 1978, 

41_, 542-556. 

Bizzi, E.,- Poiit, A., & Morasso, P v Mechanisms underlying achievement of 
final head position. Journal of Neurophysiology , 1976, 39, 435-4W. 

Brooks, V. B. Some examples of programmed limb movements. Brain Research , 
1974, 71, 299-308. 

Coleman, M. Serotonin in Down syndrome . New York: American Elsevier 

Publishing Company, 1973. 
Cooke, J. D. - The organization of simple, skilled movements. In 
~G. E. Stelmach & J. Requin (Eds.), Tutorials in motor behavior . 

Amsterdam: North-Holland Publishing Company, f980. 
Cowie, V. A. A study of the early development of Mongols . Oxford: Pergamon 

Press, 1970. 

Crago, P. E. , Houk, J. C, & Hasan, Z. Regulatory action of the human stretch 

reflex. Journal of Neurophysiology , 1^76; 39, 925-935. * 
Fel'dman, A. G. Functional tuning of the nervous system -with control of 

movement or maintenanje of a steady posture-II\ Controllable parameters 

of the muscle. Biophysics , 1966, _H, 766-775. (a) 
Fel^dman, A. G. Functional tuning of the nervous system with control of 

movement* or maintenance of a steady posture-III. Mechanographic analysis 



141 



ERIC 143 



of execution by man of the simplest motor -tasks. Biophysics , 1966, -11 , 
766 7 775. (b) 

Fel'dman, A. G. Changes in the length of the muscle as a consequence of a 
shift in equilibrium in the muscle- load system. Bipphysics , 197^t 19 , 
534-538. (a) 

Fel'dman, A. G. Control in the length of a muscle. Biophysics , 1974, 19 , 
766-711. (b) t 

Fel'dman, A. G. Superposition of motor -programs. I. Rhythmic forearm move- 
ments in man. Neuroscience , 1980, 5,, 81-90. (a) 

Fel'dman, A. G. Superposition of motor programs — II. Rapid forearm flexion, 
in man. Neuroscience , 1980, 5, 91-95. (b) 

Frith, U., & Frith, C. D. Specific motor disabilities in Down syndrome. 
- Journal of Child Psychology and Psychiatry , 1974, 15., 293-301. 

Houk, J. D. . Participation of reflex mechanisms and reaction time processes in 
compensatory adjustments to mechanical disturbances. In , Desmedt 
(Ed.), Cerebral motor control in man : Long loop mechanisms . Basel: 
larger, 1978. 

Houk, J. C, Singer, J. J., & Goldman, M. R. Afn evaluation of length and 
» force feedback to soleus muscle of decerebrate cats . Journal of 
Neurophysiology , 1970, 33, 784-811. 

James, R. J. Multivariate analysis of the walking behavior in 
institutionalized Down*s syndrome males . Unpublished doctoral disserta- 
tion, University of Wisconsin, 1974. 

Kelso, J. A.. S. Motor control mechanisms underlying human movement reproduc- 
tion. Journal of Experimental Psychology , 1977, 8« 429-443. 

Kelso, ( J. A. S., & Holt, K. G. Exploring a vibratory systems analysis of 
human movement production. Journal of Neurophysiology , 1980, 43, 1183- 
1196. 

Kelso, J. .A. S., Holt.^K. G., & Flatt, A. E. The role of proprioception in 

the perception and control of hurnjm movement: Toward a theoretical 

reassessment. Perception & Psychophysic s, 1980, 28, 45-52. 
Kelso, J. A. S., Holt, K. G., Kugler, P. N., & Turvey, M. T. , On the concept 

of coordinative structure as dissipative structure — II. Empirical lines 

of convergence. In G. E„ Stelmach & J. Requin (Eds.), Tutorials " in motor 

behavior . Amsterdam: North Holland, 1980. 
Knight, R. M., Atkinson, B. 7 R., & Hyraan, J. A. Tactual discrimination and 

motor skills in mongoloid and non-mongoloid retarded and normal children. 

American- Journal of Mental Deficiency , 1967. H. 894-900. 
Koch, R., & • de la Cruz, F. F. Down's syndrome (Mongolism) : Research, 

prevention and management . New York: Brunner/Mazel, 1975. 
Lange, E. K. A comparison of the response times of mongoloid children and 

normal children. Unpublished doctoral dissertation, University of New 

Mexico, 1970. 

LaVeck, B. t & Brehm, S. S. Individual variation among children with Down 

syndrome. Mental Retardation- , 1978, Jj6, 135-137. 
Lestienne, F., Polit, 'A. 9 & Bizzi, E. From movement to posture. In 

C. H. Nadeau, K. M. Newell, G. C. Roberts, & H. Halliwell (Eds.), The 

skiilfulness in movement : Theory and applic a tion . Champaign, 111.: 

Human Kinetics Press, 1980*. 
McCoy, E. Segal, D. J. , & Strynadka, I. Decreased ATPase/ increased NA+ 

and decreased K+ in Down's syndrome platelets. In R. Koch & F. F. de -a 

Cruz (Eds.), Down's syndrome (Mongolism) : ^ Research, prevention, and 

management . New York: Brunner/Mazel, 1975. 

142 ' ' t ^ 

lot) 



Neilson, P. D. , & Lance, J, W. Reflex transmission characteristics during 
voluntary activity in normal man and patients with movement disorders. 
In J. E. DesmedtT" (Ed.) t ^ Cerebral motor control "in man ; Long loop 
mechanisms : Progress in clinical neurophysiology (Vol. 4). Karger: 
Basel, 1978. . 

Neilson, P. D. , & Neilson, M. D. The role of action reflexes in the damping 
of mechanical oscillations. • Brain Research , 1978, 142 , 439-453. 

Owens, D., Dawson, J. C. , & Losin, S. Alzheimer's disease in Down's syndrome. 
American Journal of Mental Deficiency , 1971, 75, 606-612. 

Polit, A., & Bizzi, E. Processes controlling arm movements in monkeys. 
Science , 1978, 201_, 1235-1237. 

Rack, P. M. H. The significance of mechanical properties of muscle in the 
reflex control of posture. In P. Anderson & A. K. S. Jansen (Eds.), 
Excitatory synaptic mechanisms . Bergen, Tromio: Universitetsforlaget, 

Oslo, 1969. 

Safronov, V. A. Problem of the regulation of muscle tone.- Biophysics , iyfu, 
J15, 1145-1154. 

Schmidt, R. A., & McGown, C. Terminal accuracy of unexpectedly loaded rapid - 
movements: Evidence for a mass-spring mechanism in programming. Journal 
of Motor Behavio r, 1980, V2, 149-161. 

Turvey, M. T. Preliminaries to a theory of action with reference to vision. 
In R. Shaw & J. Bransford (Eds.), Perceiving, acting - and knowing.: Toward 
an ecological psychology . Hillsdale, N.J.: Erlbaum, 1977. 

von Bertalanffy, L. General system theory. London: Penguin University 

Books, 1973. „, ... u . 

Wine, R. L. Statistics for scientists and engineers . Englewooc* Cliffs, N.J.: 

Prentice-Hall, Inc.,, 1964. 

FOOTNOTES 

iNote that equifinality can be disrupted in deafferented but not normal 
animals when the postural relations between atu,aal and apparatus are changed 
(Polit & Bizzi, 1978). When the latter are fixed, however, deafferented 
monkeys and functionally deafferented humans (Kelso & Holt, 1980; Kelso, Holt, 
& Flatt, 1980) exhibit equifinality when the initial position of the limb is 
altered unexpectedly. < 
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PERIODICITY AND AUDITORY MEMORY: A PILOT STUDY 



Janet May and Bruno H. Repp 



Abstract . Band-limited periodic and aperiodic nonspeech stimuli 
varying in center frequency but having similar spectral envelopes 
were presented in same-different tasks at two interstimulus inter- 
vals (.5 and 2 sec). Discriminability decreased as a function of 
interval, but there was no difference betwee/i the types of stimuli, 
suggesting that periodic and aperiodic stimuli are equally well 
retained, in auditory memory . 

Studies oif categorical , perception have sometimes found unexplained 
differences between stimulus sets. For example, May *(1979) tested Egyptian 
listeners 1 discrimination of vowel-fricative-vowel syllables drawn from con- 
-tinua spanning two Arabic fricative contrasts, /jr-V and /x-V, that differ 
from each other only in voicing, having equivalent places of articulation. 
The "subjects in that study found the stimuli from the voiced fricative 
continuum easier to discriminate than those from the voiceless continuum. 
Although the two continua differed along several acoustic dimensions, the most 
conspicuous of these was the periodicity present during the frication portions 
of rthe voiced continuum and absent on the voiceless continuum. The stimuli of 
another recent study of categorical perception (Healy & Repp, 1982) included 
vowels from an /i-I/ continuum and isolated fricative noises from a /s-J/ 
continuum. When- listeners were asked to categorize stimuli from either 
continuum presented in pairs, the contrastive effect exerted by one member of 
the pair on the labeling of the other member was much larger for- vowels than 
for fricative noises, even when differences in discriminability were taken 
into account. Onc\e again, presence versus absence of periodicity seemed the 
most salient feature distinguishing the two types of stimuli, although there 
were several other differences as well. 

At least two Reasons could be proposed why periodicity might make a 
difference in auditory memory. On one hand, it seems possible that the 
recurrence of signal portions with similar structure — i.e., of the successive 
pitch periods— strengthens the auditory trace, in the way repetitions of a 
stimulus generally do. According to this hypothesis, the auditory tra^s of 
aperiodic stimuli would not have the benefit of such reinforcement-by- 
repetition and, therefore, would be «weaker than those of periodic stimuli. On 
the other hand, the repetitive structure of periodic stimuli may also be seen 
as a disadvantage. Aperiodic stimuli, by virtue of their randomly changing 
waveform, may have more idiosyncratic features to be remembered by, and 
therefore may enjoy an advantage in auditory memory. These hypotheses may be 
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called the engraving hypothesis and the redundancy hypothesis, respectively. 
Of course, both hypotheses could easily be wrong, and there may be no 
difference at all between periodic and aperiodic stimuli in memory. In that 
case, the differences between different stimulus sets in the studies cited 
above must have rested on stimulus properties other than periodicity. 

We conducted a pilot study to compare stimuli that differ only in the 
presence versus absence of periodicity and are matched in all other respects. 
For that purpose, we generated band-limited complex nonspeech sounds by 
exciting the second-formant filter of a parallel-resonance speech synthesizer 
with either a periodic or an aperiodic source. The resulting stimuli were 
well matched in spectral shape and amplitude. The center frequency of the 
formant was varied in several steps, and the stimuli were presented in a 
fixed-standard same-different pitch discrimination task at two different 
interstimulus intervals. Differences between the two stimulus sets could 
emerge either as a difference in overall discrimination accuracy or as a 
difference in the effect of increasing the interstimulus interval (i.e., in 
the decay rate of auditory memory). 
-< 

Method 

Subjects . The subjects were paid volunteers recruited by advertisements 
on Yale campus. After a number of them had been tested, it became evident 
that several listeners made hardly any errors. These listeners (4 in all) 
were replaced with new subjects until a total of 12 had been run. 

Stimuli . The stimuli were generated on the Haskins Laboratories parallel 
resonance synthesizer.' Only the second-formant circuit was used . v For the 
periodic stimuli, the "buzz" source was employed, and for the aperiodic 
stimuli, the "hiss" source. The fundamental frequency of 'the buzz was 100 Hz. 
Each set oncained five stimuli with different center frequencies; nominally, 
they were 16H, 1688, 1764, 1840, and 1917 Hz. The nominal bandwidth was 90 
Hz. To control for possible idiosyncrasies of the aperiodic stimuli due to 
the random noise source, three different tokens of each of the five aperiodic 
stimuli were synthesized. Amplitude parameters were set so as to yield equal 
amplitudes for periodic and aperiodic stimuli at output. 

All stimuli were synthesized at a duration of 65 msec. Subsequently, 
they were digitized at 10 kHz using the Haskins Laboratories pulse code 
modulation system. The digitized waveforms were trimmed to a duration of 50 
msec. This was done to eliminate artifacts at stimulus onset produced by 
starting the synthesizer at full amplitude. The periodic stimuli were reduced 
by eliminating the first pitch period and a portion from the end, so that 
exactly 5 complete, equal-amplitude pitch periods remained. The aperiodic 
stimuli were cut at corresponding points. To avoid transients, all cuts were 
made at the nearest zero-crossing. 

Before recording the experimental tapes, the stimuli were analyzed using 
both standard spectrograms and spectral cross-sections generated by a Federal 
Scientific UA-6A spectrum analyzer. The spectral envelopes of periodic and 
aperiodic stimuli with the same center frequency were closely matched. We 
noted that the actual center frequencies did not always match the intended 
ones, but these discrepancies (which may have been due, in part, to inaccuracy 

146 



153 



ERIC 



of the synthesizer and, -in part, to measurement error in spectral analysis) 
essentially left the spacing of the stimuli intact and were equally present in 
both types of stimuli. 

Four tapes were prepared for the experiment. They contained pairs of 
either periodic or aperiodic stimuli at one" of two interstimulus intervals 
(ISIs) between the members of each pair (.5 or 2 sec). The first member of 
each pair was constant; it was always the center stimulus of the continuum 
(1764 Hz). The second stimulus could be any of the five stimuli, with equal 
probability. Thus, on 20 percent of the trials, the two stimuli in a pair 
were identical; on 40 percent, the second stimulus had a higher pitch than the 
first, and on the , remaining 40 percent, it had a lower pitch. There were two 
degrees of pitch difference, depending on whether the comparison stimulus was 
one or two steps removed from the standard. The five possible pairs occurred 
30 times in random order, with 2 sec of silence between pairs for the short 
ISI and 4 sec for the long ISI. There were longer pauses after groups of 30. 
Different tokens of the aperiodic stimuli were employed in successive blocks 
of 5 pairs. 

Procedure . The four stimulus tapes were presented in a single session. 
Their order was counterbalanced across subjects. The tapes were played back 
on an Ampex AG-500 tape recorder, and the subjects listened over TDH-39 
earphones. The task was to write down "s" or "d" for each trial, depending on 
whether or not a difference could be detected between the two members of a 
pair. Each tape was preceded by several practice trials. 

A nalysis . Individual subject scores in the four different conditions 
were "converted into d« values. The proportion of "same" responses to 
identical pairs was taken as the false-alarm rate, and the proportions of 
"different" responses to the four types of nonidentical pairs were taken as 
separate hit rates. The use of d 1 values corrected for differences in 
subjects' tendency to say "same"; however, ttv d* values were confounded with 
possible differences in subjects' criteria for etecting upward and downward 
changes in pitch. Proportions of 0 and 1 were treated as .01 and .99, 
respectively, leading to an upper bound on d' of 4.66. 

A four-way analysis of variance was conducted on the d* values, with the 
factors Stimulus Type (periodic vs. aperiodic), ISI (.5 vs. 2 sec), Direction 
of Pitch Change (up vs. down), and Extent of Pitch Change (1 vs. 2 steps). 

Results 

The main results are .shown in Figure 1a. There we see that discrimina- 
tion performance declined as the ISI was extended from .5 to 2 sec, F( 1 ,11) » 
5 6 p < .05. However, there was no significant difference between the two 
types of stimuli, either in overall performance level or in the extent of the 
decline . 

The effects of the other two factors, direction and extent of pitch 
change, are shown in Figure 1b. As expected, 2-step differences were easier 
to discriminate than 1-step differences, F(l.ll) = 61.5. £ < -001, but there 
was an interaction, F(1,11) = 5.0, £ < .05: 1-step changes were easier to 
detect when pitch went down rather than up, whereas there was no effect of 
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(a) Effect of ISI on memory for periodic and aperiodic stimuli, 
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direction in 2-step changes. This interaction may, in part, derive from a 
ceiling effect for 2-step pairs. The main effect of direction fell short of 
significance, F(1,11) * M.5, J> > .05. Note that better discrimination of 
downward changes^uould be expected on the basis of Weber f s daw. 

One additional effect was significant in the analysis of variance. This 
was t\r» three-way interaction between Stimulus Type, ISI, and Direction of 
Change, F(1 ? 11) = 5.0, j> < .05. The performance decline with ISI was larger 
for downward than for upward changes in aperiodic stimuli, but larger for 
upward than for downward changes in periodic stimuli. The reason for this 
pattern is not clear. - 

Discussion 

Our results suggest that periodic and aperiodic stimuli are about equally 
well retained in auditory memory. However, we cannot prove the null hypo- 
thesis. It is possible that our experimental design did not provide the best 
opportunity for differences between the two types of. stimuli to emerge. At 
least three reasons could be envisioned. First, the stimuli were relatively 
easy to discriminate — so much so that the best subjects had to be replaced. 
Thus, the data derive only from subjects with average or below-average 
discriminatory capabilities — an undesirable state of affairs. Second, the 
stimuli were' rather brief, ,and it may be argued that their duration was too 
short for any specific advantage of periodic stimuli to arise. In other 
words, five pitch periods may not be enough to produce a sufficient amount of 
reinforcement-through-repetition, the process assumed by the "engraving hypo- 
thesis." Third, by using a fixed-standard paradigm, we may have reduced the 
subjects 1 reliance on auditory memory. In principle, subjects could have 
adopted the strategy of ignoring the standard altogether (i.e., by relying on 
a long-term memory representation of it) and bf arriving at a "same-different" 
decision by an absolute judgment of the comparison stimulus. That this 
strategy was not used exclusively is suggested v by the significant effect of 
increasing the IS!. Nevertheless, it may well be true that we did not force 
the listeners sufficiently to rely on auditory memory. 

Thus, our pilot study did not put the engraving/redundancy hypotheses to 
a very strong test, and further research will be necessary to decide whether 
either of them has any validity. However, before further experiments are - 
undertaken, we should perhaps wait for a more compelling reason to expect any 
effect of oeriodicity on auditory memory. . The present results have not 
increased our confidence that sdch effects exist. 
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READING SKILL AND LANGUAGE SKILL 
Virginia A* Mann+ 



Abstract . To learn to read is to acquire a visual language skill 
that systematically maps onto extant spoken language skills. Some 
children perform this task quite adeptly, while others encounter 
much difficulty, and it has become a question of both scientific and 
practical merit to ask why there exists such a range of success in 
learning to read. Obviously, learning to read places a complex 
burden on many emerging capacities, and In principle, at least, 
reading disability could arise at any level from genera" cognition 
to visual perception. Yet since reading is parasitic on spoken 
language, the possibility also exists that reading disability xs 
derived from some subtle difficulty in the language domain . In this 
article, my intent is to review some of the many studies that have 
explored the association between reading skill and spoken language 
skill. these reveal that when certain spoken-language skills of 
good and poor beginning readers are critically examined, consider- 
ably many, though perhaps not all, poor readers prove to possess 
subtle deficiencies that correlate with their problems in learning 
to read. 



LINGUISTIC SHORT-TERM MEMORY DIFFERENTIATES 
GOOD AND POOR B EGINNING HEADER S 

____ i i — ■ ii — I www i r >. 

One of the more compelling reasons to view reading defi6iency as the 
derivative of a language deficiency is that success at learning to read is 
associated with the adequacy of certain linguistic short-term memory skills. 
In our work at Haskins Laboratories, my colleagues and I have found clear 
indications of this association in a variety^of different studies of good and 
poor beginning readers. For the moment, however, let me put aside a 
discussion of those studies in order to consider first the short-term storage 
requirements of normal language processing, and to summarize some recent 
findings as to how these requirements are met by the mature language us'er . 
These considerations pertain to both. written and spoken language and provide a 
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necessary introduction to any discussion of linguistic short-term memory among 
.beginning readers. 



An adequate short-term memory is essential to language comprehension 
simply because the component words of a phrase or sentence must often be h6ld 
temporarily, pending extraction of the meaning of the whole phrase or sentence 
(Baddeley, 1978). It is for precisely this reason that many current models of 
sentence processing explicitly include some form of short-term memory buffer 
as a part of their parsing device (cf, Frazier & Fodor, 1978; Kimball, 1975; 
Marcus, 1980). Some consideration has been given to the 'form of memory 
representation that mediates human parsing. Current psychological theory has 
it that some level of phonetic representation is likely to be involved, this 
being an abstract representation of the articulatory gestures that constitute 
the material being parsed (Liberman, Mattingly, & TurVey, 1972). There are 
many experimental findings to corroborate this view. On the one hand, adult 
subjects have given evidence of relying on phonetic representation while 
performing such ecologically invalid tasks as recalling a string of ^etters or 
a string of words (Conrad, 1964; Drewnowski, 1980). More importantly, there 
is evidence that phonetic representation is also involved during comprehension 
of both written and spoken sentences (cf. Baddeley, 1978; Daneman & Carpenter, 
1980; Kleiman, 1975; Levy, 1977; Slowiaczek & Clifton, 1980; Tzeng, Hung, & 
Wang, 1977) v 



It is, of course, noy inconceivable that, in reading, some nonlingui^tic 
representation of written words might be employed in lieu of a phonetic one 
(cf; Kleiman, 1975; Meyer, Schvaneveldt , & Ruddy, 1974). There is, after all, 
much evidence to suggest that access to the mental lexicon for printed words 
may not necessarily require reliance on phonetic representation (cf. Baron, 
1973; Kleiman, 1975; Meyer et al., 1974). Nonetheless, it is important to 
emphasize that reading typically involves more than mere lexical access alone. 
A successful reader must often go beyond the lexicon and place reliance on the 
grammatical structure of the material being read. In contrast to experiments 
involving lexical access, those experiments concerned with reading situations 
where, sentence structure is at stake have consistently given evidence of the 
involvement of phonetic representation (Daneman & Carpenter, 1980; Kleiman, 
1975; Levy, 1977; Slowiaczek & Clifton, 1980). Even readers of Chinese 
logography, an orthograpny in which access to the lexicon is necessarily 
mediated by non-phonetic representation, appear tc make use of phonetic 
representation when their task involves recovering the meaning of written 
sentences and not simply words alone (Tzeng et al . t 1977). 

For adult subjects, phonetio representation is clearly involved in both 
written and oral language comprehension . Having made this point , let me 
return to the primary concern of this paper, which is a review of some recent 
studies of good and poor beginning readers. Ttfese provide another form of 
support for the involvement of phonetic representation in all language 
processing, by revealing that effective use of phonetic representation is 
associated with, and may even presage success in, learning to read. I intend 
to review some of the many findings that support this conclusion; however, it 
might be useful first to provide some basic information about the population 
of beginning readers 4 whom my colleagues and I have studied, since they have 
provided much of the data to which I will refer. 
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Most frequently our subjects have been first, second, and third graders 
who attend public schools* All of them are native speakers of English who 
suffer froirf no known neurological impairment ♦ They are identified by their 
teachers as being "good," "average/ 1 or "poor" readers, a status that we 
confirm by administering standard reading tests to each child (typically the 
Word Attack and Word Recognition Subtests of the Woodcock Reading Mastery 
Tests, Woodcock, 1973; or the Word Recognition Subtests of the Wide Range 
Achievement Test, Jastak, Bijou, & Jastak, 1965) . Administration of these 
tests has typically revealed the "good" readers' to be reading at a level one 
or more years above their grade placement, whereas the "average" readers are 
reading at a level between one year above and one-half year below placement. 
The "poor" readers tend to be reading at a level one-half year or more below 
grade placement. Aside from administering standard reading tests, we have 
also usually given our subjects intelligence tests (either the Peabody Picture 
Vocabulary Test, Dunn, 1959; or the Slosson Intelligence Test for children, 
Slosson, 1963; or ' the WISC-R), and have excluded those children in either 
reading group who score below 90 or above HI5. 
* <, 

One of the more general findings to emerge from our work is that good and 
poor readers may differ in temporary memory for some types of material, but 
not for other types (Katz\ Shankweiler, & Liberman, in press; Liberman, Mann, 
Shankweiler, & Werfelman, in press; Mann & Liberman, in press). An example of 
this trend may be seen in the results of a study that assessed recognition 
memory skill among good and poor beginning readers (Liberxnan et al . , in 
press). The subjects were second graders who differed in reading ability, but 
not In mean age or nean IQ. They participated in an experiment that employed 
the recurring recognition memory paradigm of Kimura (1963) as a means of 
evaluating memory for several different types of material. The material we 
studied included two tion-linguistic materials—photographs Of unfamiliar faces 
and nonsense "doodle" drawings— and one linguistic material—printed nonsense 
syllables. For each of these, the children inspected a set of * stimuli and 
proceeded to indicate any of the inspection items that recurred in a 
subsequent recognition set. As may be seen in Figure 1, the poor readers were 
equivalent to the good readers in memory for faces and even somewhat better 
than K the good readers (although not significantly bo) in memory for the 
nonsense drawings. However, they were significantly inferior to good readers 
in memory for the -nonsense syllables. Thus there is an interaction between 
reading ability and the type of item being remembered;* an interaction that 
prevailed in an analysis of covariance adjusting for any effects of age or IQ 
differences . * 

Clearly, this experiment cannot support a conclusion that poor readers 
suffer from some general memory difficulty. Rather, they appear deficient 
only in the ability to remember linguistic material. Many findings that 
concern short-term memory lend further support to this conclusion. Good 
readers typically surpass poor readers in short-terrn memory for printed 
strings of letters or printed words (cf. Shankweiler, Liberman, Mark, Fowler, 
& Fischer, 1979;' Mark, Shankweiler, Liberman, & Fowler, 1977) as well as for 
printed nonsense syllables. However, good readers also excel at recall of 
spoken strings of letters (Shankweiler et al., 1979), spoken strings of words 
(Bauer, 1977; Byrne & Shea, 1979; Katz & Deutsch, 1961; Mann, Liberman, & 
Shankweiler, 1980; Mann & Liberman, in press), and even spoken sentences (Mann 
et al., 1980; Perfetti & Goldman, 1976; Wiig & Roach, 1975; Weinstein & 
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Rabinovitch, 1971). At this point it is important to note that, since the 
advantage of good readers holds for both written and spoken material, it must 
extend beyond processes involved in reading, as such, to the broader realm of 
language processing. 

To account for the linguistic memory distinctions between good and poor 
readers, some of my colleagues (Liberman & Shankweiler, 1979; Shankweiler et 
al., 1979) offered the hypothesis that poor readers have some difficulty that 
specifically compromises effective use of phonetic representation. Therefore, 
they used a modification of Conrad's ( 1 964 ) procedure for examining the 
involvement of phonetic representation in memory for written letter strings, 
to test a group of good, average, and poor readers from a second-grade 
population that was homogeneous with respect to age and IQ. As was the case 
in Conrad's procedure, the children were asked to recall strings of five 
consonants that were of two basic types ./""Hal f of the strings were composed of 
consonants with phonetically confusable (i.e., rhyming) names, - whereas the 
other half contained letters with' phonetically nonconfusable (i.e., nonrhym r 
ing) names. During testing, the children saw a letter string with all of its 
letters printed in upper case on a single line in the center of the visual 
field. After a three-sec inspection period, when the letters could no longer v-. 
be seen, they wrote down any letters that could be remembered, preserving the 
sequence as closely as possible. 

On the basis of Conrad's findings, Liberman, Shankweiler, and their 
colleagues predicted that nonrhyming letter names would generate fewer phonet- 
ic confusions than rhyming ones, and thus facilitate recall in subjects who 
rely on phonetic representation as a means of retaining letters in short-term 
memory.. It was felt that if a subject's level of performance failed to profit 
from reduced phonetic confusability , then that subject might have made less 
effective use of phonetic representation as a mnemonic device. The perfor- 
mance of good, average, and poor readers on the two types of letter strings is 
compared in the top section of Figure 2. Good readers, in general, made fewer 
errors than poor readers, and the average readers fell in betweeri. The 
performance of the'good readers, however, was also more significantly affected 
by the manipulation of rhyme than was that of the average or poor readers. In 
fact, tne advantage of the superior readers was virtually eliminated when the 
letter strings contained letters with phonetically confusable names. In other 
words, phonetic confusability penalized the better readers to a greater extent 
than children in the other two reading groups. 

These findings were extended by two subsequent experiments involving the 
same group of subjects and the same set of letter strings. In the first of 
these, the letters of each string were presented visually, but successively 
rather than simultaneously. In the second experiment, the letters were 
presented successively, but auditorily rather than visually. The results of 
these experiments are also displayed in Figure 2, where it may be seen that, 
once again, the interaction betwen reading ability and the effect of phonetic 
confusability was upheld. Indeed, it prevailed even when the letters were 
heard instead of seen. It is important to underscore the fact that reading 
ability was the only variable that interacted with the effect of phonetic 
confusability on letter recall. The children with higher IQ scores did tend 
to perform at a higher level than those with lower scores; however, the extent 
of their superiority was the same regardless of whether the comparison 
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involved phonetically confusable letter strings or phonetically nonconfusable 
ones. P^s. the interaction between reading ability • and the effect of 
phoiUti<\c<5nfusability was unaltered when the analysis of the data covaried 
for any effects of IQ. 

To strengthen these- findings about poor readers' ineffective use of 
phonetic representation, my colleagues and I followed the. study of letters- 
string recall with a study of the role of, phonetic representation in recall of 
other, more ecologically valid material such as spoken word strings and spoken-" 
sentences (Mann et al . , 1980). In that study, the subjects were again good 
and poor readers from a second-grade classroom. This time ,. however , the good 
readers had a slightly higher mean IQ than the poor readers. The experiments- 
involved having the children in each group repeat strings of five spoken 
words, and also the words of 13-word sentences that were either meaningful or 
semantically anomalous. The material's included many different items of -each 
type, but. for word strings and both types of sentences, half of the items 
contained a high density of phonetically confusable (i.e., rhyming) words, 
whereas naif contained phonetically nonconfusable words instead. Children's 
performance on the word strings is compared in Figure 3. and that on sentences- 
is compared in Figure q. As can be seen .in- those figures, for word strings, 
as -well as for both meaningful and semantically anomalous sentences, good 
readers made fewer errors than poor' readers as lo % ng as the. material was 
-phonetically nonconfusable. For all three types of material, however, they, 
fell to the level of the poor readers when the material contained a high 
density of phonetically confusable words.' In this experiment ,. although good 
readers tended to have Higher IQ's, a significant interaction between reading 
ability and the effect of phonetic' confusabil ity was obtained when -the results 
were subjected to an analysis of c'ovariance that adjusted^ for any difference* 
in IQ.. 0*ce again, intelligence alone was not the source of the good readers 
more effective use of phonetic representation. t ' 

Tnus, whether the material is -apprehended by ear or by eye, and wnether 
it involves letter strings or meaningful sentences, the performance of good 
readers tends to be both ' superior to that of poor readers a*d also more 
strongly affected by manipulations of phonetic confusabil fty. For most good 
-readers, as- for most' ^dult-6, phonetic confusabil ity ■ of the material to be 
recalled makes reliancj on phonetic representation a liability rather than an 
asset. In contra^Tphonetic confusability has little effect on the memory 
performance of most poor readers, a fat* that we interpret as evidence that 
they are, for some reason, encountering difficulty with phonetic representa- 
tion. . ~ 

. CLARIFYING THE ftASIS OF. POOR REjVDE_RS_]_ PROBLEMS 
WITH LINGUI3TIC SHORT-TERM MEMORY ■ 

At this point, it becomes ' appropriate" to consider vtfiy good and poor 
readers might differ in performance on tasks that involve reliance on phonetic _ 
representation. We can lay aside the possibility that memorial representa- , 
tion, in general, is a problem, since if this .were so, poor readers would have - 
been inferior on other tests of temporary memory and not merely on those that 
involve reliance on phonetic representation. A' general cognitive deficiency 
would also seem an unlikely basis, given our findings that IQ scores are not 
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Figure 3. Mean error scores of good and poor readers on recall of word 
strings , in nonrhyming and rhyming conditions. (Maximum = 50 
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significantly associated with sensitivity to manipulations of phonetic, confu- 
sability. Two other possibilities seem more plausible. On the one hand, poor 
readers might not resort to phonetic representation at all, relying instead on 
visual or semantic modes of representation. However, it is likewise possible 
that they do attempt to employ phonetic representation, but for some reason 
their representations\are less effective. 

One piece of evidence that is relevant to this issue is provided by the 
results of an experiment in which I extended Liberman and Shankweiler ' s study 
of letter string memot y to a population of second- and third-grade children 
who were learning to read Dutch. The subjects were the ten best readers and 
the ten worst readers in each grade; their mean ages and reading abilities are 
given in Table 1. The procedure was the same as in the first experiment of 



Table 1 



Age and Reading Ability Among Beginning Readers of Dutch 



Mean Age Mean Grade-equivalent 

in Years Reading Ability* 



Second Graders: 



good readers 7.4 1 

poor readers 7.3 1 • 1 



Third Graders: 
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good readers 8.2 5.5 

poor readers 8.5 2.3 



"Grade-equivalent scores measured by the Ein-Minuten Test; Berkhout, 1972. 



Shankweiler et al . (1979) with one innovation. In constructing the le,tter 
strings I separately manipulated phonetic and visual confusability , since 
this was more feasible in Dutch than in English. Thus it was possible to 
examine recall of three different types of upper-case consonant strings: 
strings of letters that were phonetically confusable but not visually 
confusable; strings of letters that were visually confusable but not phoneti- 
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cally confusable', and strings of letters that were minimally confusable along 
both the visual and phonetic dimension. .In all cases, the measure of phonetic 
confusability was the density of letters with rhyming names, since that 
measure had been employed by the Conrad (1964) study on which the Shankweiler 
et al. (1979) study had been based. The measure of visual confusability was 
derived from the upper-case letter confusion matrix compiled by Townsend 
(1971), and was the summed probability of visual confusion for each possible 
pair of letters in a given string. Computed in this way, the mean confusabil- 
ity for the ten visually confusable strings was 0.81, and was significantly 
greater than that for either the ten phonetically confusable or the ten 
minimally confusable strings (0.27 and 0.31, respectively, td8) = 3.1, P<.01, 
and t(18)=2.8, p< . 01 , respectively). 

As no children's IQ test was available in Dutch, I controlled for 
nonlinguistic short-term memory rather than for general intellectual ability. 
The test of nonlinguistic memory that I administered was the Cor si test 
(Corsi, 1972). The materials for that test consist of a set of nine wooden 
cubes attached in a random fashion to a flat wooden base. The entire 
apparatus is painted black; there are identifying numbers on the rear surface 
of the cubes that can be seen by the experimenter although not by the subject. 
During testing, the subject watches the examiner tap out a sequence of blocks 
and then attempts to reproduce that sequence. Practice sequences of two and 
three blocks are given first, followed by eight test sequences of four and 
eight of five blocks each. The suitability of this test as a measure of 
nonlinguistic short-term memory is indicated by clinical studies revealing 
that whereas performance on linguistic short-term memory tests is selectively 
impaired by damage to the left or language-dominant hemisphere, that on the 
Corsi blocks shows the opposite pattern of selective impairment as a conse- 
quence of damage to the right, or language-nondominant hemisphere (Corsi, 
1972; Milner, 1972). 

Because of my experience with American children, which had revealed no 
significant relation between reading ability and non-linguistic memory, I did 
not anticipate finding that good and poor beginning readers of Dutch would 
differ in performance on the Corsi test. There seemed to be no reason to 
anticipate that children in the two reading groups would differ in nonlinguis- 
tic abilities. It did seem possible, however, that poor readers would do less 
well than good readers on the letter-string memory test, and that they might 
also be differently affected by the manipulations of phonetic and visual 
confusability. Proceeding from the fact that phonetic confusability penalizes 
recall in subjects who rely on phonetic representation, I speculated that if 
poor readers rely on visual representation, then they might be inordinately 
affected by the manipulation of visual confusability. 

The results of the study are given in Table 2. where all memory test 
scores are error scores that include errors of item omission and substitution, 
as well as of incorrect order. In that table, it may be seen that despite any 
differences in the Dutch and English languages or in the educational practices 
by which they are taught, the memory profiles of good and poor readers in the 
two countries prove quite similar. As we have found to be the case for 
American children-. Dutch children who are poor readers are equivalent to good 
readers in performance on the nonlinguistic short-term memory test: 
F(1,39)=1.6, p>.10, although older children tend to do better than younger 
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ones: F( 1 f 39)=4.9, P<.05. Older children also tended to do better on the 
letter string test: F( 1 , 39)= 1 1 . 8, p<.005. More importantly, the good begin- 
ning readers of Dutch tended to surpass the poor readers in memory for 
consonant strings, and they did so at both age levels tested: F( 1 , 39)=45. 0, 
p<.0001. 



Table 2 

Results Obtained with Beginning Readers of uutch 



CORSI LETTER STRINGS 

BLOCKS Phonetically Visually Non- 

Conf usable Conf usable conf usable 



Second grade: 

good readers 20.5 15.4 23.4 16.4 

poor readers 24.5 29.8 29.8 30.9 

Third grade: 

good readers 15.9 11.0 20.7 8.5 

poor readers 18.2 24.5 26.9 26.2 



(Max. = 72) (Max. = 50) (Max. = 50) (Max. = 50) 



In these data, there is, further, the anticipated interaction between 
reading ability and the effect of the various manipulations of letter 
confusability: F(2,72)=28. 3, p<.0001 . The better readers surpassed the 
poorer ones in memory for the minimally confusable letter strings, this being 
true for both second: t(18)=4.8, p<.001,' and third graders: tO8) = 10.3, 
p<.001. However, the good readers at both ages fell to the level of poor 
readers when they attempted to recall phonetically confusable strings. A 
further twist to these data involves the effect of visual confusability, or 
rather, its non-effect. Neither good nor poor readers were affected by the 
presence of a higher density of visual confusability. That is to say, for 
both groups of subjects at both age levels, performance on the visually 
confusable strings was no different from that on the nonconfusable ones. This 
gives u:; no reason to believe that in this task the poor readers opted for a 
purely visual representation of the letter strings. Either they relied on 
some as yet undetermined form of representation, or they relied on phonetic 
representation and for some reason failed to profit from reduced phonetic 
similarity a.nong the letter names. 
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Some direct evidence in support of the possibility that poor readers do 
sometimes rely on phonetic representation may be found in the pattern of 
errors these children make when they attempt to recall a phonetically 
confusable string of spoken words. Some of my colleagues and I recently 
analyzed the responses made by good and poor readers who were attempting to 
recall such a string (Brady, Shankweiler, & Mann, 1982). Hie subjects were 
participating in an experiment that will be described in more detail below; 
they were good and poor readers from a third-grade classroom and they did not 
significantly differ in IQ. They were asked to repeat strings of five words 
that were either phonetically confusable or phonetically nonconfusable . As in 
the past, the good readers tended to excel with respect to the poor readers, 
but also tended to be more greatly affected by the manipulation of phonetic 
confusability. We also found, however, that although children in both reading 
groups made many substitution errors, the poor readers tended to make more of 
these than the good readers. We therefore turned to analyzing the composition 
of the substitution errors and their relation to the words of the original 
string. 

Our analysis revealed that the pattern of substitution errors was the 
same for good and poor readers alike. Almost no substitutions were semantic 
associates of the words in the string being recalled; instead, the majority 
were composed of a subset of the phonemes that had constituted the words of 
the string being remembered. For example, a great proportion of the errors 
contained an appropriate initial consonant and even more contained an appro- 
priate vowel or final consonant. Thus it seemed as if the children in both 
reading groups had remembered many of the phonemes they had heard. The poor 
readers, for some reason, had merely made more errors in recalling the 
original word strings, perhaps because their phonetic representations were 
less wel] formed, or perhaps because their representations decayed more 
rapidly t.nan those of the good readers. 

Thus, in at least some circumstances, it seems that poor readers may rely 
on phonetic representation to some extent; otherwir-8 they would not have 
tended to make substitution errors that preserve phonetic aspects of the 
original word string. Before leaving this topic, it would be pertinent to 
mention the possibilit tnat problems with phonetic representation may lorce 
the poor readers to rely on ^mantic representation during certain memory 
tasks. Although my colleagues and I have seen almost no semantically-based 
substitution errors among either good or poor readers, this has not been the 
case in another study done by > Byrne and Shea (1979). These investigators 
compared the performance of good and poor beginning readers on a spoken-word 
recognition memory test, and found that, in general, good readers performed at 
a higher level than poor readers. They also discovered that children in the 
two groups tended to make different types of errors. Whereas poor readers 
made proportionately more false recognition errors on semantic associates of 
the correct items, good readers tended to make more such errors on words that 
were phonetic associates. For example, when asked to remember and subsequent- 
ly recognize "home," poor readers tended erroneously to recognise "house," but 
good readers, "comb." Yet when the task was to remember nonsense syllables 
instead of words, children in both reading groups made many errors on phonetic 
foils Once again, however, good readers somehow made more effective use of 
phonetic -epresentation, as evidenced by their tendency to make fewer errors, 
in generaL, coupled with their tendency to make disproportionately many errors 
on phonetically-similar foils. 



Turning now to the question of why the phonetic representations of' poor 
readers may be 3 ess effective than those of good readers, let me return to the 
above-mentioned study by Brady et al . (1982). In that study an approach to 
the problem of phonetic representation was inspired by the finding that, when 
speech perception is stressed by the presence of background noise, short-term 
memory span is inordinately affected (Rabbitt, 1968). This finding led us to 
consider the possibility that the short-term memory difficulties of poor 
readers might be associated with some difficulties in encoding speech. 
Therefore, we designed an experiment to compare the ability of good and poor 
readers to identify spoken words that were partially masked by white noise. 

The third graders who were subjects of this study did not differ in age 
or IQ, but did differ in reading ability, and also in menory for strings of 
spoken words. Their performance showed the usual interaction between reading 
ability and the effect of phonetic confusabil ity. They were asked to identify 
a pre-recorded set of spoken words that contained an equal number of high and 
low frequency words and was balanced for phonetic constituents and syllabic 
structure. Each child heard the words under two different conditions: first 
partially masked by signal-correlated white noise, and later under more 
optimal listening conditions. 

The results revealed that although the poor readers were not significant- 
ly different from good readers in performance under the optimal conditions, 
they made about 35% more errors when the words were partially masked. That 
this problem could not be attributed to some basic vocabulary deficiency could 
be seen from the fact that differences between children in the two reading 
groups obtained equally for high and low frequency words, and also from the 
fact that the subjects of our study had performed at the same level on the 
Peabody Picture Vocabulary Test (Dunn, 1959). It is also consistent with this 
observation that an interaction between reading ability and the effect of 
partial-masking was obtained with an analysis that covaried for the effects of 
age and IQ. 

To determine whether the findings of this experiment were specific to 
speech perception , as opposed to being an attribute of general auditory 
perception, we conducted a second experiment. In it, the same subjects were 
asked to identify a set of environmental sounds taken from a standard clinical 
test, including such sounds as a cat meowing and a door slamming. The 
procedure was analagous to that in the previous experiment with spoken words; 
the subjects first identified the sound when partially masked by white noise, 
and later when presented under more optimal listening conditions. The pattern 
o f results for this second ex periment proved distinct from that obtained in 
the first one. Many of the poor readers were actually better than the good 
readers at identifying the partially-masked sounds, although this difference 
is not significant. An analysis of covariance that adjusted for age and IQ 
effects reveals that, although the noise penalized the overall level of 
performance, there was neither an effect of reading ability nor an interaction 
between reading ability and the penalizing effects of the noise masking. 

Thu:> it would appear that any deficiency in auditory perception on the 
part of the poor readers is limited to the realm of speech perception . 
Although more research is needed to clarify the relation between this speech 
perception deficiency and poor readers 1 problems with phonetic representation, 
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the fact of its existence is certainly provocative and most pertinent to the 
view that reading skill is associated with language skill. 

LINGUISTIC SHORT-TERM MEMORY SKILL MAY PRESAGE READING SUCCESS 

Having made a link between reading skill and effective use of phonetic 
representation in linguistic shoVt-term memory tasks, and having reviewed some 
of the evidence as to why poor readers may have difficulty with phonetic 
representation, I will now concentrate on some ramifications of this difficul- 
ty. According to the view introduced in the beginning sections of this paper, 
phonetic representation is crucially involved in all normal language process- 
ing. Since spoken language antedates written language, and insofar as 
phonetic representation is involved in spoken language processng, difficulty 
with phonetic representation should often be found as an antecedent of reading 
failure. 

A study completed only a short time ago speaks to this point, revealing 
that those kindergarten-aged children who make less effective use of phonetic 
representation in a word-string recall task are likely to become the poorer 
readers of their first-grade classrooms (Mann & Liberman, in press). The 
-subjects for that study were a population of kindergarteners whom we followed 
longitudinally for one year. During May of the kindergarten year we assessed 
their memory for spoken strings of phonetically confusable and nonconfusabl e 
words, their memory for nonlinguistic material (the Corsi block sequences), 
and their awareness of the syllabic structure of spoken words. The following 
year, as first graders, these same children again received all of the memory 
tests, and a standard reading test. At this time they were rated by their 
teachers as "good," "average," or "poor" in reading ability. 

The findings for the two years of the study are given in Table 3* Note 
first that the children in the three reading groups had equivalent IQ scjres; 
we found no correlation between IQ scores and our measures of reading 
achievement. The children in the three groups also performed, equivalently on 
the Corsi test of nonlinguistic memory; neither their kindergarten nor their 
first grade scores on this test were correlated with our reading measure. In 
contrast, however, both of our linguistic measures proved able to distinguish 
between children in the three different reading groups. Elsewhere we have 
discussed, the relation between success at learning to read and the ability to 
realize the syllabic structure of spoken words (see, for example, Liberman & 
Mann, in press; or Mann & Liberman, in press). Here I will focus on the 
relation between effective use of phonetic coding and reading skill. It can 
be seen in Table 3 that children in the three reading groups* were strongly and 
significantly differentiated oy their performance on the phonetically non- 
confusable word strings. As first graders, children's performance on this 
type of word string was significantly correlated with their reading ability— • 
more importantly , a significant correlation also existed between their kinder- 
garten performance on the phonetically nonconfusable word strings, and their 
first-grade reading ability. Note further that both as kindergarteners and as 
first graders, the poorer readers tended not only to perform at the lower 
levels on the word string memory test, but also to be among those least 
af fee tec by the manipulation of phonetic confusabiiity . Thus, their ineffec- 
tive use of phonetic representation not only associated with their difficulty 
in learring to read, but actually presaged xt. 165 
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TABLE 3 



Mean error scores of good, average and poor readers on memory tasKs 
A longitudinal study (IQ determiaed in kindergarten, reading 
achievement in first grade) 



READING 
ABILITY 

GRADE 
LEVEL 


VERBAL-MEMORY 

Max =32 


NONVERBAL MEMORY 

Max =32 


SYLLABLE SEGMENTATION 
TASK 


Nonrhyming 
Word Strings 


Rhyming 
Word Strings 


Corsi Blocks 


(Percent passed in Kdgn.) 




GOOD 
N = 26 

IQ 114.7 


READERS 
KDGN 
1st GRADE 


8.1 
5.5 


13.4 
12.1 


8.4 
8.7 


85% 




WERAG 
N = 19 
IQ 114.7 


5E READERS 
KDGN 
1st GRADE 


12.8 
9.2 


15.4 
11.3 


9.0 
8.1 


56% 




POOR 
N = 17 
IQ 115.5 


READERS 
KDGN 
1st GRADE 


13.2 
13.7 


15.0 
12.7 


10.1 
10.1 


24% 



READING SKILL, LINGUISTIC SHORT-TERM MEMORY AND ORAL COMPREHENSION 

The finding that effective use of phonetic representation can be a 
precursor of reading success is consistent with the view that reading skill 
derives from language skill, given the position that effective language 
comprehension is linked to effective phonetic representation, and the presump- 
tion that successful comprehension is essential to learning to read well. 
Clearly, one final demonstration is called for. If poor readers tend to make 
less effective use of phonetic representation than good readers, and conse- 
quently encounter difficulty retaining the words of sentences, then we may be 
able to demonstrate that they are less able to comprehend spoken sentences, 
especially if comprehension demand, reliance on an effective short-term memory 
store. 

Together with some of my colleagues (Donald Shankweiler and Suzanne 
Smith) I am currently analyzing the results of a study that asks whether there 
exists a three-way link between reading skill, effective use of phonetic 
representation, and spoken language comprehension. Clearly the existence of 
such an association would further support the view that reading skill is a 
product of language skill. The subjects of this most recent study are good 
and poor readers from a third grade population that J.s Homogeneous with 
respect to age ?id IQ. They have been given a test of memory for strings of 
phonetically nonconfusable words, and several different tests of oral language 
comprehension, including two tests of our own design and one standardized 
clinical test. Thus far, we have only completed our analysis of the results 
of the standardized tests, a test called the Token Test (DeRenzi & Vignolo, 
1962). 

In the Token Test, subjects receive a series of oral instructions that 
specify how they are to manipulate a set of small colored "tokens." It has 
enjoyed considerable success as a reliable indicator of disorders of oral 
comprehension both among patients with acquired language deficits (DeRenzi & 
•Vignolo, 1962) and children With developmental language disorders (LaPointe, 
1976). We chose to use it because it forces reliance on the grammatical 
structure of a sentence rather than on common-sense knowledge or extralinguis- 
tic caes, and also because it poses an obvious stress on short-term memory. 

The test itself consists of five basic parts that are graded in 
complexity. For the first four parts, all of the instructions are simple 
imperative sentences that contain a constant verb and either one or two noun- 
phrase objects. The instructions systematically increase from part to part in 
the number of objects involved and in the adjectival content (one or two 
adjectives) of the noun phrase. For the fifth part, the instructions contain 
as many words or more than those in the third and fourth parts, but further 
contain a series of different verbs and different noun phrase structures in 
the predicate. Thus the first four parts of the test involve a systematic 
increase in the number of objects and attributes that the subject must 
remember, whereas the fifth involves not only a substantial memory load out 
also an increase in syntactic complexity. 
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In general, the results* of our study of Token Test performance have 
revealed that ^oor readers tend to do less well than good readers. In 
particular, we find that they do' as well as good readers on the first three 
parts of the test, but fall behind on the last two parts. We had anticipated 
that the fourth and fifth parts might pose relatively more difficulty for the 
poor readers, simply because they contain the longest instructions. However, 
we recognize that -difficulty on .the fifth part of the test could also be a 
consequence of a more specific difficulty with, recovering syntactic structure, 
aside from a short-term mentory deficiency. Thus, while we have indeed 
established a relation between reading ability and oral comprehension of 
sentences, it remains to be determined whether ineffective use of phonetic 
representation can account for this relaticA* in any direct way. We have some 
indication that for the children whom we tested, performance on the Token Test 
was at least moderately correlatad with word-strirfg memory performance. It 
also appears possible that for both the good and poor readers, the errors made 
on part five may have been direct consequences of the memory demands posed by 
certain instructions. We hope to continue to gain more insight into this 
issue as we analyze the results of our other two comprehension tests. 

As we pursue this and other research, my colleagues and I are entertain- 
ing several possible outcomes. On the one hand, ineffeotive phonetic repre- 
sentation could not only compromise ongoing sentence processing, but also 
limit the development of linguistic competence. It- is also within the realm 
of possibility that poor readers possess a comprehension deficit that is not 
so much a consequence as a concomitant of difficulty with phonetic representa- 
tion. Perhaps reading disability, ineffective phonetic representation, and 
comprehension deficiencies are all manifestations' of some more general 
language impairment that we have only begun to characterize. Surely the 
characterization of that impairment will be a productive research objective, 
since it may both illuminate our understanding of the psychology of reading, 
and clarify our approach to the current epidemic o£ reading failure. 
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ON THE ROLE OF SIGN ORDER AND MORPHOLOGICAL STRUCTURE IN MEMORY FOR AMERICAN 
SIGN LANGUAGE SENTENCES 



Vicki L. Hanson and Ursula Bellugi-* 



Abstract . Sentence processing in a visual/gestural language was 
investigated by testing signers' recognition for American Sign .. 
Language (ASL) sentences. Using a continuous recognition paradigm, 
sign order and structural changes that altered the meaning of a 
sentence were noticed at both immediate and delayed (45 seconds) 
test intervals. Sign order and structural changes that resulted in 
a paraphrase of an earlier-occurring sentence were noticed only with 
immediate testing. These results indicate that signers decompose a 
complex sign into its lexical and inflectional components during 
sentence comprehension and remember the meaning expressed by these 
components rather than remembering the exact sign structure. 

In the past decade, it ha3 become clear that there are primary gestural 
systems, passed down from one generation of deaf people to the next, that have 
taken their own course of development as autonomous languages. American Sign 
Language (ASL) is the common form of communication used by deaf native signers 
among themselves across the United States and parts of Canada. 'ASL is a 
primary visual-gestural system, not based on, nor derived from, any form of 
English, having its own lexicon and grammar (Klima & Bellugi, 1979). Because 
of the radical difference in the transmission medium for signed languages as 
opposed to spoken languages, ASL affords an opportunity to examine a question 
not easily investigated in other ways: namely, to what extent is language 
processing shaped by the production modality? 

Research on American Sign Language shows that this visual-gestural system 
exhibits formal structuring at the' same two levels as spoken languages:, a 
sublexical level of structure internal to the sign (the phonological level in 
spoken languages) and a level of structure that specifies the ways signs a<-e 
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bound together into sentences (the grammatical level). ASL thus shares 
underlying principles of^ organization with spoken languages, but the formal 
devices that appear arise from the very different possibilities afforded by 
the visual-gestural modality. In spoken languages, the structuring of lexical 
items and morphological processes is essentially sequential. Words are 
composed of sequentially produced sounds ; morphological pT ocesses commonly 
involve affixation of phonemic segments sequentially ordered in the sound 
stream. In ASL, however, signs are composed of contrasting fo. mational 
parameters, co-occurring throughout the sign, and the morphological processes 
of ASL involve embedding the sign .stem in superimposed contours of movements. 
Sign • nd inflectional marker thus co-occur in time (Bellugi, 1980). 

In relation to the structure of the signs themselves , it has been 
determined that signs are not just holistic and iconic, but rather are 
composta of a limited number of arbitrary formational components that combine 
in regular and constrained ways in the signs of the language (Battison, T 97^ ; 
Stokoe, Casterxine, & Croneberg, 1965). These parameters (handshape, loca- 
tion, and movement) have been found not only to be formal linguistic 
descriptions .^f signs, but .so to be psychologically real in the sense that 
signers rely on these parameters in sign processing. In studies of short-term 
memory for .*igns, signers characteristically make errors based on these 
formational parameters (Bellugi, Klima, & Siple, 1975; Frumkin & Anisfeld, 
1977 X. Similarly, "slips of the hand 11 involve exchanges of these components 
(Newkirk, Klima, Pedersen, & Bellugi, 1980). 

ASL differs dramatically from spoken languages in the form of its 
morphological processes. As a visual-gestural language, its morphological 
devices embed sign stems in superimposed changes of space and movement. 
Figure 1, for example, shows the sign PREACH under a variety of morphological 
operations [e.g., PREACH (basic sign), f preach to them, ? 'preach to each of 
them, 1 1 preach regularly, 1 etc.].1 

The wide variety of semantic distinctions that are obligatorily marked 
morphologically in ASL sentences are often indicated in English either 
lexically or phrasally. ASL verb signs, for example, undergo obligatory 
inflections for referential indexing , indicating subject and/or object of the 
verb; for reciprocity ; for grammatical number , marking distinctions such as 
dual and multiple; for temporal aspect , indicating distinctions such as 
♦regularly, 1 'over and over again, 1 'for a long time, 1 'gradually 1 ; for 
distributional aspect 'to ecch, 1 'to any,' 'all over, 1 'to certain ones. 1 
There are also a large number of derivational processes such as those that, 
derive nouns from verbs, that derive predicates from nouns, and that signal 
figurative or extended meanings. The elaborate system of formal inflectional 
dev-ices, their widespread use to vary the form of signs, and the variety of 
fine distinctions they systematically convey suggest that ASL, like Latin and 
Navajo, is one of the inflective languages of the world (Klima & Bellugi, 
1979)^ 

The present research examines for the first time whether these formal 
linguistic descriptions of ASL morphological structure correspond to psycho- 
logical representations that signers use in the interpretation and retention 

of sentences. 
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Several experiments with English have been conducted to test whether 
linguistic descriptions of the language can be applied to describe the way in 
which readers/listeners process sentences. These experiments, taken as a 
whole, provide evidence that while descriptions of grammatical structure are 
psychologically real in the sense that they are used in sentence comprehen- 
sion, these structures are not used as a basis for sentence representation in 
memory. 

In a seminal study, Sachs (1967) presented data strongly supporting the 
hypothesis that after the meaning of a sentence is comprehended, the exact 
wording and the syntactic structure are forgotten. Sachs had passages of text 
fead to normally-hearing college students and, after various intervals of 
interpolated-material, tested the students' recognition memory for sentences 
of the text.' The test sentences could be different from an earlier-occun ing 
sentence in one of the following three ways: (1) different by a semantic 
change; (2) different by an active/ passive change; or (3) different by a 
formal change (e.g., "He called up Mary" vs. "He called Mary up"). For 
immediate testing, performance was quite accurate for all sentence change 
conditions. But after the presentation of as little as 27 sec. of interpolat- 
ed material following the target sentence, only the semantic changes were 
noticed. These results have been replicated in later studies using Doth 
spoken and printed sentences and text with«adults (Anderson, 1971; Begg, 1971; 
3egg & Wickelgren, W; Brewer, 1975; Fillenbaum, 1966; Johnson-Laird & 
Stevenson, 1970; Sachs, 1971: Wanner, 197D and with children (Trembath, 
1972). 

An experiment by Bransford and Franks (1971 ) clearly demonstrates the 
tendency of subjects to recall the meaning rather than the form of sentences. 
Subjects in that study were presented semantically-related study sentences 
that contained one, two, or three propositions. In a recognition test, 
subjects tended to falsely recognize four-proposition sentences that they had 
never seen before, but which were semanticaily consistent with the integration 
of propositions across the study sentences. 

The present research is concerned with whether signers similarly abstract 
the meaning of ASL sentences and then retain this meaning independent of the 
sentence structure. At . one level this experiment tests retention of sign 
order. The work with English readers/listeners has indicated that there is 
little retention of the exact word order of sentences for anything but 
immediate testing. It is similarly expected here that signers will not retain 
information about the sign order of ASL sentences in long-term memory. Work 
with English sentences has also shewn that following sentence comprehension, 
readers'listeners do not retain information about the exact lexical composi- 
tion of sentences. The morphological processes of ASL afford the opportunity 
to provide a more stringent test than is possible with English of the tendency 
of language users to renember the meaning of sentences independent of the 
lexical composition. The morphological processes of ASL, by superimposing 
movement patterns on basic lexical signs, strikingly alter the dynamic visual 
form of the sign. Of interest here is whether signers will remember the 
global form of a sign or whether they will decompose the complex sign into its 
lexical and inflectional components and remember only the meaning expressed by 
these components. 
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UNINFLECTED 




PREACH (b«aic) 



R£F£R£NTIAL INDEXING 




PtlKCB lR ^ ipr0ca11 PREACH tDu * 11 PREACH l * lAtip1 * 1 

'pruach to ««ch oth*r' 'prtach to both* 'prtach to oth«rV 



Figure 1. The uninflected sign PREACH and its form under a variety 
inflections. 
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TEMPORAL ASPECT 




PWACa lItabitluX1 
•praach regularly' 




PUACH lI " ratlvtI 

•'praach over and orex again* 




'preach tor a long tin*' 



DISTRIBUTIONAL ASPECT 




•preach to aach of thaa' 




pMACH {Apportion -stive Internal) 
•praach all pver* 




(Apportion* tive Extarnal] 
'praach anong meraoers of a 
group' 




pKEMCH tAllocatlva De£arminata] 
'praach to car tain onaa' 




pR]EACH lAllocativa. Indeterminate) 
'praach to any and all' 



Figure 1 , continued , 
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The paradigm to be used is one adapted from work on memory for sentences 
in both written and .spoken English (Begg, 1971; Begg & Wickelgren, 197*4) . In 
these paradigms, subjects are presented with several sentences. Af 4 aer various 
intervals, a test sentence is presented. Subjects are asked if- that sentence 
is identical to an earlier-occurring one. In the present experiment, a test 
sentence can be changed from the target in one of four ways: Two of these . 
changes ( Formal and Lexical changes) will be paraphrases of the target 
sentence, preserving meaning. The other two changes ( Inflection and Semantic 
changes) will have different meanings than the target. 

METHO D 

Stimulus Materials and Design „ 

Stimuli were ASL sentences. The sentences were independent of each other 
in terms of grammatical structure and content. They were not related in a 
story context. ^There were three different kinds of sentences: Original, 
test, and filler sentences. Original sentences were the first presentation of 
an experimental sentence. Test sentences were the second presentation of an 
experimental sentence . 

Fifty experimental sentence pairs were used in five testing conditions. 
In one condition,' the original and test sentence were identical. Eighteen 
pairs of experimental sentences were included in this condition. The other 32 
sentence pairs' were equally divided among four conditions, in which the test 
sentence was changed from the original sentence. There were ^eight .sentence 
pairs in each of the following four change conditions: 

a) Formal Formal changes involved a change in sign order with no 
resultant change in meaning. An example of a Formal change is given below. 
Both the ASL gloss and the English translation are presented. 



Original: DOCTOR NO IX:' to me 1 ] EAT CHEESE MILK ETC. THAT** 
English 

translation: The doctor told me not to eat dairy products. 

(1) Test: CHEESE MILK ETC. THAT DOCTOR NO [X: ! to me 1 ] EAT 
English 

translation: Dairy products are the food the doctor told me not to 
eat . 

b) Le xical : Lexical changes involved pairs of sentences that had the 
same meaning expressed in two different ways., Thus in one sentence a 
particular meaning was carried by two lexical signs while in the other the 
sairie meaning was conveyed by a single inflected sign. An example of such a 
sentence pair is the* following: 



176 



Original: SUPPOSE WINTER (MY) FRIEND [iDs'chum'] S-U-R-E OFTEN SICK 
PITY [Xr'him 1 ]. 

' * English 

translation: If it's winter my chum will surely get sick often, poor 
thing. 



(2) Test: SUPPOSE WINTER (MY) FRIEND [iDj'chum'] S-U-R-E SICK , r 
[M:Frequentative] PITY [X^him']. 

English 

translation: If it's winter my chum will surely get sick often, poor 
thing. 

The difference in form between the phrase OFTEN SICK and the modulated 
SICK[M: Frequentative], both of which have the same meaning, is shown in 
Figure 2. 

> * 

c) Inflection : Inflection changes involved pairs of sentences that were 
.different in meaning. The original sentence included inflected signs; in 

the tes't sentence, the inflections were transposed, but the order of the 
sign stems was maintained, resulting in a different meaning. The following 
sentence pair is an example in which the inflections on GIVE and INFORM are 
exchanged. Figure' 3 illustrates this change. 

Original: MARY , JANE PAPER GIVELN : Reciprocal] , BUT NEVER 
INFORM[I:Multiple] THEIR WORK. 

translation 1 : Mary and Jane give each other papers, but never inform., 
others about their work. 

(3) Test: MARY, JANE PAPER GIVElN : Multiple] BUT NEVER 

INFOlRMtltReciprocal] THEIR WORK. 

transla'tiS: Mary and Jane give others papers, but never inform each 
other about their work. 

d) S emantic : For Semantic changes, the meaning of the test and original 
sentences was altered by a change in sign order in which two signs 
exchanged positions, often across clauses. An example of such a Semantic 
change is given here, where MATH and MUSIC are interchanged. 

Original: MY WIFE TEACH [M :Continuative] MATH ALL-MORNING, 
AFTERNOONS TEACH [M:Habitual] MUSIC. 

English . , . , , 

• translation: My w*fe works hard teaching math all morning, and in the 
afternoons she generally teaches music. 

(4) Test: MY WIFE TEACH [M:Continuative] MUSIC ALL-MORNING, 

AFTERNOONS TEACH [M:Habitual] MATH. 

English . . ■ 

translation: My wife works hard teaching music all morning, and in the 
afternoons she generally teaches math. 
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OFTEN SICK (Basic) 



i 




gjCH (M: Frequentative] 
■eaning 'often sick 1 



Figure 2. An example of a Lexical change. In the top panel, the meaning 
'often sick 1 is expressed by two lexical signs: OFTEN and SICK. 
In the lower panel, the same meaning is expressed by the inflected 

l78 sign SICK[M:Frequentative] . 
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GIVE [R: Reciprocal) 
meaning 'give each other 1 




INFORM [N l Mu 1 1 iple] 
weaning 1 inform others 1 




GIVE[N:Multiple] 
meaning 'give others 1 



INFORM [R : Reciprocal] 
meaning 'inform each other 1 



Figure 3. An example of an Inflection change. In the top panel, the signs 
GIVE and INFORM are inflected with the reciprocal and multiple 
inflections respectively. In the bottom panel, the same sign.s 
occur but with transposed inflections, resulting in different 
meaning. 



179 



ERJC 



185 



There were thus five different experimental conditions as defined by the 
different relationships between original and test sentence pairs. In addi- 
tion, there were two different time intervals tested:* an immediate test and a 
45 second delayed test. Sentences in the five experimental conditions were 
tested half the time at each interval. 

Stimulus Tape 

Stimulus sentences were signed by a native signer of ASL and were 
recorded on videotape for use during testing. Natural facial expression was 
rnoluded in all sentences. The hands of the signer were restored to a neutral 
position between sentences to indicate , the end of one sentence and the 
beginning of another. The first three sentences presented were filler 
sentences. 

Test sentences were indicated by a star that appeared in the upper left- 
hand corner of the screen at the onset of each such sentence. Following each 
test sentence, a blank interval lasting approximately five seconds was 
included to be used as a response interval. 

For the immediate test, the test sentence followed the original sentence. 
For the delayed test, four sentences always intervened between original and 
test ^sentences. These four intervening sentences included original and test 
sentences as well as filler sentences. In many cases, there was also one 
response interval between original and test sentences. This difference in 
events between original and test sentences was caused by the variance in 
length of sentences used in the experiment. The number of intervening 
sentences was held constant, however, and the time of the delay interval was 
held constant at 45 seconds. 

Instructions, signed in ASL, were recorded on the beginning of the 
videotape. 

Procedure 

Subjects were instructed that they would see several ASL sentences and 
that they were to pay careful attention to each. At various times test 
sentences would be presented and would be indicated by a star in the upper 
left-hand corner of the screen. For each, subjects were todecide if the test 
sentence was exactly the same as a sentence that had been presented previous- 
ly. They were to circle YES on their answer sheet if the test sentence was 
the same as an earlier-occurring sentence and to circle NO if the test 
sentence was not exactly the same as an earlier one. Subjects were instructed 
that "exactly the same" meant the same signs and same sign order as well as 
the same meaning. ^ 

In addition, subjects were asked to make a confidence judgment about each 
sentence. Ihey were to circle whether they were "VERY SURE," "SORT OF SURE," 
or "GUE3SING" about their decision as to whether the original and test 
sentences were the same. 

*The stimulus sentences were preceded by a practice session that included 
four practice test . sentences. All sentences included in the practice phase 
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were simple sentences designed to illustrate clearly the nature of the 
procedure and to indicate that the structure as well as the meaning of the 
sentences would be important in the experiment. During this practice, 
subjects' answers were checked after each response. If a subject had answered 
incorrectly, the original and test sentences were shown to the subject again. 

Subjects 

Subjects were ten deaf volunteers recruited through the Center on 
Deafness at California State University, Northridge. Nine of the subjects had 
deaf parents and had learned ASL as a first language. The other person had 
grown up signing and was considered by native signers to be a skilled ASL 
user. There were five women and five men, mean- age 24.2 years. 



RESULTS 

■ ' » 

The percentage of trials on which the subjects responded that the test 
sentence was "different" from the original sentence is given in Table 1. For 



Table 1 



Mean percentage responses in which subjects responded that the test sentence 
was different from the original sentence. Also shown is the difference in 
such responses at the two time intervals. (A negative numbc-r as the 
difference indicates that there Was more tendency to respond that the test 
sentence was different for delayed than for immediate testing.) 



Formal 

Lexical 

Inflection 

Semantic 

Identical 

(«p<.05) 



Immediate Delayed Difference 

test test 



97.5* 59.2% 38. 3% » 

92. 5% 67.5% 25.0% « 

90.0% 81.7% 8.3% 

90.0% 95.0% -5.0% 

28.4% 37.3% -8.9% 
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Figure 4. Percentage of "different" responses greater than "chance" for the 
four change conditions. 
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all but the Identical sentences this percentage indicates percentage correct 
responses. For Identical sentences there was no difference in subjects' 
accuracy at the two response intervals, t(9) = 1.'<8, £>.05. 

The percentage of "different" responses for the Identical sentences may 
be taken as an index of subjects' bias to respond that a test sentence is not 
the same as the original. Following Sachs, "chance" is therefore defined here 
as the percentage of "different" responses for Identical sentences. The 
results are graphed in Figure H as the percentage of "different" responses 
greater than "fiance." 

Analyzing the percentage of "different" responses for the four change 
conditions, with immediate testing it was found that subjects were equally 
likely to respond that the test sentence was different from the original for 
all four sentence types, F(3. 27)=1 . 00, MS e=1 25.00, £>.05. Subjtots were 
therefore able to notice all four types of sentence changes equally well with 
immediate testing. They did not, however, notice the different types of 
sentence changes' equally well with delayed testing. An analysis of variance 
on the four types of change conditions by intervals indicated main effects of 
both condition, F(3.27)*1. 22, ^=195.30, p<.025, and interval, F( 1 , 9)=16. 35, 
M§_e-339-56. p<.01, that were qualified by an interaction of interval by 
condition, Ff3, 27) = 10. 20, MS e= i76.02, p<.001. Tnus , while performance was 
generally better with the immediate test than with the delayed te'„, the 
degree to which the time interval adversely affected performance was dependent 
upon the condition being tested. Results of a Tukey (had) post hoc analysis 
indicated that there were significant differences between the immediate and 
delayed testing only for Formal and Lexical changes (those that preserved ■ 
meaning) (p<.05). Inflection and Semantic changes, both of which changed 
meaning, were noticed as well with the delayed as with the immediate test 
(p>.05). 

A one-way analysis of variance was performed "on accuracy in the five 
delayed test conditions. The main effect of condition was significant, 
F(H, 36)=19. *»9. MSe=2U8.47. ■ P<.01. Post hoc analyses were undertaken to 
determine the bills for thif effect. Results of these tests indicated that 
the percentage of -''different" responses was greater than "chance", for all four 
change conditions (Dunnett's t statistic, p<.05). Additional analyses indi- 
cated a distinction between meaning-preserving and meaning-changing sentence 
oairs- For both the Inflection and Semantic changes, subjects were more 
likely to respond that the test sentence was "different" than they were 0 
respond "different" for the Formal and Lexical changes (Newman-Keuls , p<.0b). 
Thire was no difference in the percentage of "different" responses between 
Semantic and Inflection changes (Newman-Keuls, £>.05) nor between Formal and 
Lexical changes (Newman-Keuls, £ >.05). These results indicate that while 
subjects responded more accurately than .would be expected by chance for all 
four types of sentence changes, their ability to notice the sentence changes 
was dependent on the type of change: changes of meaning (Inflectional and 
Semantic changes) were noticed more consistently than were meaning-preserving 
changes (Formal and Lexical changes). 

To obtain a score for the confidence rating, subjects' resppnses were 
assigned .the following numerical, values: VERY SURE=3. SORT 'OF SURE*. 
GUESSINGrl. If the subject responded that the test and original sentences 
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were identical, their confidence rating was multiplied by -*L If the subject 
responded that the test and original sentences were not the 3ame, their 
response was given a score equal to the confidence rating. Subjects' scores 
on this confidence rating were then analyzed . Mean confidence ratings are 
shown in Table 2. 



Table 2 



Mean confidence ratings at both immediate and delayed testing. A positive 
nirctrer indicates a tendency to respond that the original and test sentences 
were. different . A negative number indicates a tendency to respond that the 
original and test sentences were identical. The scoring procedure is 
explained in the text. 

Immed iate Delayed Di f f erence 

test test 



Formal 2.88 .56 2.32 * 

Lexical 2.57 .97 1 . 60 • 

t Inflection 2.30 1.77 .53 

Semantic ■ 2. MO 2.67 -.27 

Identical ; -1.2*1 -.73 . -.51 

\»p<.05) 



A t-test was performed" on subjects 1 , confidence in responding that the 
- Identical test sentenc&s we^e the same as an earlier-occurring sentence at the 
two response intervals. • The nonsignificant results, t(9)=1.69. £>.05 f indi- 
cated that for Identical sentences there was no. difference in subjects 1 
confidence of their responses at the' two time intervals. 

Scores for immediate test in the four change conditions showed no 
difference in .confidence for the different conditions, F(3. 27)=1 • 71 • MSe=.372, 
p>.05. Scores were then subjected to an analysis of variance on condition by 
interval* Results indicated main effects of both condition, F(3. 27)=5. 25. 
2§e*-.533. P<.01, and interval, Fd\ 9)=25 . 02, MS e =.87i, P<.01, that were 
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qualified ,by an interaction of the two variables, F( 3 • 27 ) = 1 1 . 16 , MS e =.587« 
£<.001 t Again, a post hoc Tukey (bsd) analysis indicated the difference in 
performance at the two* intervals was apparent only for the Formal and the 
Lexical changes (jg<.C5>. This pattern of results reflected the fact that with 
delayed testing, subjects were not confident of responding that the test 
sentences were different from the original sentences for the formal and 
Lexical changes althougn they were confident of responding that the test 
sentences were different from the original sentences for the Inflection and 
Semantic changes. 

In this experrment, therefore, subjects wer.e both accurate and confident 
in noticing Inflection and Semantic changes even with 45 seconds intervening 
between original and test sentences. In contrast, sentence changes that 
preserved meaning (the Formal and Lexical 'changes) were accurately and 
confiaently noticed only with immediate testing. 



DISCUSSION 

This experiment represents one of* the fir t attempts to study ASL 
sentence processing. It was found that signers use syntactic structure to 
comprehend sentences but represent the meaning of ASL sentences in long-term * 
memory independently of the sign order and the holistic sign structure. 

As in studies with English, paraphrases were not noticed well with 
anything but immediate testing. .This does not mean that only sentence meaning 
was retained. -Sachs (1967) also noted this fact, stating that subjects "did 
have some ability to recognize the form of the sentence but that it was quite 
low and contrasted greatly with their memory for the semantic content of the 
sentence" (p. 441). The present finding that all four types of sentence 
changes are recognized better than "chance" is consistent with this conclu- 
sion. In particular, research has shown that when subjects know in advance 
that their memory, for sentences is being te^ ed (.as in the case of the present 
study), additional information about the surface form of sentences is retained 
(Anderson, 1974; Begg & Wickelgren, 1974; ' Johnson-Laird-^ Stevenson, 1970). 
This is similar' to work showing that people can remember .such "trivia" as the 
typography of words when reading (Kolers & Ostry,^ 1<}74). Thus, it is 
apparently the case that people can remember many types of information about 
the sentences they process: However, there is strong evidence that the 
primary information remembered about sentences is the semantic interpretation 
for written and spoken sentences and, as shown here, for signed ASL sentences. 

One type of paraphrase in the p\resen f , study involved a change in word 
order. These Formal chjanges were not noticed with delayed testing even 
though, as shown in sentence pair (fl ) , the sign order changes generally 
involved a topicalization.change. Sentences in ASL often follow what has been 
referred to as a topic-cdmment structure. This means that the topic of the \ 
sentence, marked by a Specific facial expression, occurs _£irst, and is/ 
followed by <a comment on 'that topic. For example, in the test sentence of 
sentence pair (1), the tjopic is "dairy products." The comment is that "ti^fe 
doctor told me not to eat, [them]." In the criginalsentence for that pair, the 
topic of the sentence was , that "the doctor told me not to„.(do something)" and -* 
the comment explains that the prohibited activity is eating dairy products. 
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^ Sentence topicalization in ASL is marked by an eyebrow raise and an upward 
. head tilt (Liddell, 1977) ♦ The beginning of the comment is signaled by a 
relaxation of head and eyebrow position, A topicalization change was present 
for three of the four test sentences at each delay. The fact is that subjects 
did not notice even these Formal- changes. 

Semantic changes in this experiment, as well as Formal changes, were 
caused by changes in sign order. But when the sign order change also caused a 
meaning change, the change was noticed.; Thus, it is not the sign order, per 
se , that was remembered , but rather it was the semantic content that was 
remembered . j 

One important, aspect of the present results is that signers were shown to 
/ be representing /the meaning of a signed sentence independently of its 

morphological composition . This was shown strikingly by the results of the 
Lexical change condition. Subjects remembered the meaning of the sentence but 
after a brief; dplay did not remember the form in which the meaning w.ts 
conveyed. Thusb for example, subjects' remembered the meaning ! frequerM> 
sick 1 but did. not remember if the actual sentAce contained a two-sign phrase 
" OFTEN SICK or b single inflected sign, SICK[M:^equentative] . 

This finding agrees nicely with work by Poizner, Newkirk, Bellugi, ^6 
iU.ima (1981) on short-term memory for lists of inflected signs. In that 
study, subjects saw snort lists of inflected signs and wrre asked to recall 
" " the list by signing the items immediately after each list. Becall errors 
reyealed that subjects recalled the base sign and its inflection independent- 
ly. Thus, signers decomposed the sign into its meaning components and did not 
retain the oexact form of the sign. 

Notice, however, a striking difference in this work on sentence memory 
and the results from the short-term memory paradigm of Poizner et al\ (1981). 
For the "lists of inflected signs in their study —subjects confused which 
inflections were superimposed on which basic signs. But tMs was not true 
when the inflected signs were put in the meaningful sentence context of the 

i_ present experiment. The high recognition accuracy for Inflection changes, 

even at longer intervals, indicates that people did not make this confusion. 
Results from the Lexical change condition further show that this lack of 
confusion was not a result of signers remembering the exact form of the sign 
presented. Rather, this accurate performance in the Inflection ciiange condi- 
tion' can be attributed to the fact that switching t> * inflections changed the 
meaning of the t£st sentence.' Thus, signers wer able to\ reject the test 
sentences for Inflection changes, not uecause they noticed that the sign forms 
were changed but because they noticed that the meanings of the test sentences 
were changed. 

In recent years a picture has b^en emerging in which basic cognitive 
processes constrain the structure of signed and spoken languages so that 
underlying structure fbr languages inHxth modalities exhibit many similari- 
ties (Bellugi, 1980). The present work on sentence memory suggests that in 
signed language as in spoken langi&ges, the meaning; of a sentence is 
abstracted and the structural mechanism by which this meaning is conveyed is 
not retained in the long-term memory representation of the sentence. 
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FOOTNOTE 

1 Words in capital letters represent English glosses for AJSL signs. The 
gloss represents' the meaning of the basic uninflected form of a sign. A 
bracketed symbol following a sign gloss indicates the grammatical process the 
sign has undergone (X: for referential indexing; M: foF modulation for tempo- 
ral aspect or focus; N: for numerosity inflection; D: for derivational 
process"; iD: for. idiomatic derivative). The symbol may be followed by a 
specification of the inflectional process or by the meaning of the inflected 
fonp. For example, GIVEEN Exhaustive] and GIVEEN^to each 1 ] are alternative 
ways of representing the same inflectional process. The solid bar above 
specific parts of a sentence indicates sentence topicalization . 
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FSRCEPElCm OF "NASAL CONSONANTS WITH SPECIAL REFERENCE TO CATALAN 
Daniel Recasens+ 

• 1. INTRODUCTION 



Iii this paper I study the role that different place cues play in the 
recognition of nasal stops. I claim that their perceptual relevance is 
strongly dependent on how they are related at the articulatory and acoustic 
levels and, essentially, on tfce nature of the process of speech perpeption 
itself. I 'show that this is the case by investigating experimentally 
interactive perceptual effects between transitions and murmurs in the recogni- 
tion of final unreleased alveolar [n], palatal and velafr [7]- after ^a] in 
Catalan, using synthetic speech stimuli. 1 Special emphasis is given to the 
cues for the palatal nasal. 

I proceed first to investigate what acoustic properties of the signal can 
be shown to convey -place information by -looking at a large amount of 
production and perceptual data on nasal murmurs and formant transitions. The 
role of releases in the process of place identification for nasals is also 
taken into account. 'A consideration of other cues besides formant transitions 
seems highly advisable. In 'an early perceptual experiment with synthetic 
speech (Liberman, Delattre, Cooper, & Gerstman, 1954) it was found that-, in 
contrast with initial non- nasal stops, final nasal consonants ([m], L n J> L^-l) 
' after different vowels were properly identified only 55# of the time for 
stimuli wrth appropriate transition endpoints and a cross-category fixed nasal 
murmur. Results from more receht : experiments, both with synthetic (Garcia, 
1966, 1967a, 1967b; Hecker, 1962; House, 1957; Nakata, 1959) and with natural 
(Henderson, Note ,f; Mal^cot, 1956; Nord, 1976) speech stimuli have shown that 
not only formant ' transitions but also murmurs and releases are cues to .place 
of articulation for nasal consonants. As will* be shown, experimental data 
from the literature oil speech perception suggest that all these cues for 
nasals ought to be considered as interdependent and, therefore-, need to be 
taken into account in models of the perceptual evaluation of place cues. 
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One must also consider the place cues for []*>] within the overall nasal 
set* It will be pointed out in what way their being taken into 1 consideration 
at the analysis and synthesis levels affects theoretical considerations about 
perceptual relevance of transition patterns advanced by other scholars in 
earlier experiments that did not account for [JH. 

The allophonic system of Catalan nasals in absolute final position is 
adequate to test these hypotheses with synthetic speech experiments. It 
consists of unreleased [m] t Ln], [jO t L*)j and .allows analysis of the 
perceptual effects of transitions vs. murmurs with special reference to place 
cues for []*]. In my experimental paradigm, which differs from approaches 
£aken previously by other investigators, complete ^patterns of synthetic 
transitions and murmurs are directly .based on real speech utterances and 
combined reciprocally in , perceptual continua for all the different place 
categories. Perceptual * results are related to production data on nasals 
collected from Catalan speaker^ and speakers of other, languages, and discussed 
in the light of the literature. Evidehce^for a complementary perceptual 
influence of transitions and murmurs consistent with parallel effects observ- 
able for Sbth cues at the acoustic level is reported. This and other, findings 
argue for^ some form of motor theory (Liberman, Cooler/ Shankweiler, & 
Studdert-Kennedy, 1967) that refers to the unitary articulatory gesture to 
account for the perceptual processing , of dynamic acoustic cues in syllables 
ending with nasal stops. An integration model similar to that of Dorman, 
Studdert-Kennedy, and Raphael (1977) for non-nasal stops is proposed to 
account- for the perception of nasals after [a] as well as other v vocalic 
nuclei. 

Data for nasal consonants in syllable-final position are taken intc 
consideration because of the fact that the perceptual effectiveness of murmurs 
in place recognition in this position is known to be considerably higher than 
in syllable-initial position (Malecot, 1956; Nord, 1976). .Open vowels La], 
[*] have been chosen for analysis and discussion since the perception ^of 
< consonantal nasalization improves with [a] vs. ti], tu] (Ali, Gallagher, 
Goldstein, & Daniloff, 1971; Martony, 1964; Zee, ,1981)/ Also [?], which 
happens to be harder to identify, in general, than other; nasal consonants 
(Garcia, J966, 1967a, b; Malecot, 1956; Ohala, 1975), can be recognized with 
La] quite easily when synthesized (Hecker, 1962) or presented in natural 
speech (Wang & Fillmore, 1961 ), but rather poorly when it happens to be 
contiguQus to [i], tuh In section 4, I will also refer to the interactive 
effects of nasal cues and other' vowel nuclei. 



2. CUES FOR NASAL C ONSONANTS: PERCEPTUAL RELEVANCE^ 
ARTICULATORY AND ACOUSTIC CHARACTERISTICS 

2.1. Manner Cues • 

Certain well-defined spectral characteristics of .nasal murmurs mark nasal 
consonants as a class, independent of place of articulation and the adjacent 
nasalized vowels (Delattre, 1958, 1968; Fant, 1960; Fujimura, 1962; Fujimura & 
Lindqvist, 1970; Hattori, Yamamoto* & Fujimura, 1958; Mattingly, 1968). 
Formant transitions, on the other hand, are essentially place markers; in 
'fact, as shown below, only the first formant transition contributes effective- 
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ly to manner identification. In, the following paragraphs I refer to those 
spectral characteristics,- fcheir geheral articulatory correlates and perceptual 
relevance, in preparation for discussing not only differences in murmur 
patterns for nasals of different place of articulation but also those 
experimental paradigms concerned with place identification that make use of 
^ftxed or slightly variable murmurs: 

a. First formant (Nl), at around 250-300 Hz, with higher intensity than 
-the upper spectral regions, dependent on a large internal cavity size (pharynx 

and nasal subsystems) behind .the tongue constriction because of nasal cou- 
pling. According to Delattre (1958), the < intensity level of N1 and the other 
spectral regions of the murmur is around 6 dB (N1 ) and 15 dB (N2 K N3, N4. .*) 
lower than fqr a K normal non-nasalized vowel. It seems tc be the most 
important class* cue for nasal consonants, in contrast with the negligible 
perceptual role of the frequencies^ of higher njasal formants (Delattre, 1968). 

b. * Presence of an antiformant (NZ) , varying in frequency* with place, 
according to the. size of the mouth cavity behind the tongue .constriction, 
which acts as a shunt. It 'seems to convey mainly place information. 

c. Concentration of fonnants (N2, N3, H4. ..) between 300-4000 Hz, with 
large bandwidth (BW) values, mainly due to the 3arge surface ^rea of the nasal 
cavities and the dissipative energy losses originated within them. The small 
perceptual 'significance of those formants seems to result not only from their 
low intensity level with respect to N1 (especially N2,. often absent as 
reported by Fant, 1962, and Weinstein, McCandless,* Mondshein, & Zue, 1975) but 
also from their spectral variability. * In synthetic speech experiments, the 
following ranges of fixed frequency values have been found effective in 
realizing the nasal murmur when place is held constant: N2: * around 1000-1150 
Hz; N3: around 2000-2500 Hz (Delattre, 1954; Libennan et al., 1954; Massone, 
1980, for Argentinian Spanish; Miller & Eimas, 1977). 

The value for N2 has been proved to be dependent on the size of the 
narrow velar passage . to the* nasal cavity (Bjuggren & Fant, 1964). The 
significance of an N3 around a "typical" 2200 Hz area has also been pointed 
out. by Fant (1 962)*; this resonance seems to be chiefly dependent on. the. 
characteristics of the pharyn* cavity (Fant, 1960; Fujimura, 1962). In line 
with these observatidns, De Mori, Gubrynowicz, and Laface (1979) have recently 
proposed the automatic interpretation of any frequency concentration between 
2000-2800 Hz as N3 and of the first nasal formant below it as N2 as a speech 
recognition rule for identification of nasal consonants. 

There are also available data on optimal formant bandwidth values for 
consonantal nasality. Thus Mrfrtony (1964) ha* stressed the perceptual rele- 
vance of an N2 bandwidth value around 250 Hz. given an optimal N1 value at 
around 100-150 Hz. Such an N2 bandwidth is close tc frequencies chosen as the 
most favorable for nasal perception by Nakata (1 959) (N2: 200 Hz; H1: 300 
Hz) 0 and Pickett (1 965) (N1. f N2, N3: 180 Hz; N4; 300 Hz).. 

d. Overall lower intensity level than vowels. House (1 957) assigned to 
the murmur an intensity 8 dB lower than that appropriate for [i]. 
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Other manner cues besides nasal murmurs need to be accounted for: 

* -a*. Vcwel nasalization, taken into consideration in 'experiments with 
svnthetic apeecji for a 100 msec (Haskins Laboratories QPR 1?, 1954, 'Appendix 
2) and 20 msec (Miller & Eimas, 1977) overlapping period between vowel and 
consonant. It can be simulated by replacing F1 by two formants (N1, F1 ) and 
an antifonnant (NZ) and by increasing gradually 'and monotouically N1 and NZ 
values from 0 (absence of oronasal coupling) to 600 Hz for N1 and 650 Hz for 
NZ (Fujimura & Lindqvist, 1970, Figure 13) or 660 Hz for N1 and 700 Hz for NZ 
(Fant, I960) (presence of a small degree of oron^asal coupling). It would be 
desirably to reproduce the effect of higher extra pole-zero concentrations of 
the nasalized vowel and to reach a better tinder standing of the configuration 
of frequency continuities and discontinuities between murmur and vowel for- 
mants at the closure onset in order^to find out to what extent , they help to 
identify consonantal nasality. A continuity between vowel formants' and nasal 
poles (F1-N1, F2-N3, P3-H4. ..) (Fant, 1960) is confirmed by my considerations 
on analysis and synthesis of nasals (section 3*2.). Moreover, Takeuchi, 
Kasuya, and Kido ( 1 97 5 ) have shown that an uninterrupted pole excursion 
between vowel and nasal consonant, with the addition of some nasality 
parameter'* that represents the amount of spectral difference between nasal and 
vowel spectra, can be regarded as a valid cue for detection of nasals as a 
class. 

b. Nasalized releases following the nasal murmur, different from non- 
n^teal stop releases in • presenting low- frequency masking (Blunstein & Stevens, 
1%9). 

c. F1 transitions, generally negative but less so than for non-na^l 
stop3. This differential acoustic cue has perceptual relevance for nasals'as 
a class (Fant, 1967; Mattingly, 1968; Miller 4 Eimas, 1977). 

2.2. Place Cues 



A. Nasal murmur . 

A 

It has been suggested that 'nasal murmurs also play a relevant role in 
identificaticm of different * places of articulation. Thus, experimenters at 
Haskxns Laboratories ^emphasized very early the. polyvalent natur~e of formant 
transitions and nasal murmur characteristics iif the process of place discrimi- 
nation among nasal consonants (Cooper, Delattre, Liberman, Borst/& Gerstman, 
1952). As Fant (1980) has proposed recently: "The base rule stating that 
stationary segments signal the manner and transitions the place of articula- 
tion has more exceptions than one might expect, yhus the nasal murmur of [m], 
[n], and l] may contain strong place cues..." (p. 14). In the light of such 
observations, I will argue for consistent place cues in the nasal murmur 
portion by comparing relevant production and perceptual data. 

Table 1 presents mean and extreme frequency values for N1 , N2, N3, N4 and 
NZ in production data from male speakers 'of different languages (Czech, 
German, English, Hungarian, Polish, Russian, and Swedish)' reported by several 
authors. A summary of the results, included at the bottom part of Table 1, 
shows that formant patterns for h*] and [9] are very similar except in the 
case, of N4, which is' higher for t^J than for [j*]; [m] and. [n] present lower 
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Tablt 1 



PoXt-i.ro analyii. froquoncy v«lu*. (in Ht) corraapondii* to tha 
nutl auraura of-ULUl. W. M * Caach C9). OtM (9). 
tofli.h (4), Hungarian (8), Polish (1, 5, 6, 7), Ruaaian (2, 9) 
and Stidiah 3) (Si apaakara) . A a«nary of valuaa ia alao includad. 



[-] 



300-340 

250 < 

100-350 

280-290 

100-400 

300 

300 



H2 

850-900 
800 

875-1300 

950-980 

800-1200 

750 

850 

850 

600-800 



14 IZ Hafarancaa 

(1 )Ihtfdawici (1967) 

(2) Fint (I960) 

(3) Iant (1973) 
UJftrfiaura (1962) 

(5) Jaaaa« (1964) 

(6) Jaaaas (1973) • < 

(7) Kaoproaaki (1963) 
(e)lb«dica (1969) 
(9)K«iportl (1973) 



1150-1500 1 700-2000 _ 
1000 2000 550 

2000-2500 _ < j 

1360 1950 750-1 250/ 
2150-2500 2800-3000 _ 
1300 2000 800 
1300 ' 1850 1100 

2000-2600 _ 

1300-1900 1800-2900 _ 



310-430- 
250 

100-300 
300 

100-400 

330 

320 



900-1000 1500-1800 1900-2350 0 ) 

800 —1200 • 2000 T5bo -UX 

1000-1750 2400-2900 „ (3) 

1000 1450 TGOQ T?5O-2700 (4) 

1700-2000 2150-2500 2800-3000 _ (5) 

1200 1750 " 2050 1450 (6) 

900 1500 205Q 1750 (7) 

1 000-1 600 2050 _ _ ■ W 

830 1 900 2200 _ (9) 



[j J 310-430 
250 

100-400 

345 

340 



1000*1200 1500-1600 2000-2300 _ 
800 1900 2600 2200 

1100 2200-2500 2800-3000 _ 
1225 2100 2750 2750 

1000 "1650 2180 2400 

1500-2000 _ _ _ 

900-1100 2200-2500 2850 _ 



0) 
(2> 
(5) 
(6) 

81' 

(9) 



] 320-440 
300 

100-500 

250-400 

100-400 

350 

350 



900-1300 
1000 

1000-1 250 
950-1150 
900-1100 
1100 ' 
1050 

750-1500 
800 



1500-2200 2000-2550 (1 * 

2200 2900 (2) 

1625-2400 3200-4000 (3) 

1700-2200 2300-3000 aFova 3000 (4) 

2200-2500 2800-3000 _ (5) 

1900 " 2750 abova 3800 (6) 

1800 „ 2450 - _ (7), 

2300-2600 (8) 

2000 I56o (9) 







S wastry 




[a] 100-400 
285 


600-1300 
• 795 


1000-2600 1700-3000 550-1250 (Extrta.es) 
1610 2090 860 (Maan) 




[nl 100-430 
300 


600-2000 
1115 


; 1 ' "■ <j 

1200-2900 1900-3000 1450-2700 (Extraaas) 
1790 2190 1770 (Maan) 




[Iw] 100-430 
J 320 


800-2000 

mo 


1500-2500 2000-3000 2200-2400 (Extraaas) 
1920 2450 2450 (Maan) 




[*) 100-500 
330 


750-1500 
1060 


1500-2600 2000-4000 abova 3000- 

3800 (Ixtrsats) 
2015 2650 abovo 3400 (Maan) 
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formant values, those for [m] being even lower than those for tnh Data on N1 
fre'quency values present a succession [J]>[j0>[n]>[m3 presumably related to 
cross-category differences in size of the coupling section at the velar 
pharyngeal passage" and pharyngQnasa^tract size; although the differences- in 
those frequency values are small t should be pointed out that they can be 
"found in data reported by some researchers in Table 1.2 Data on NZ frequency 
values give an arrangement [9]>£j^]>[n]>[m] that is consistent with" I$Z depen- 
dence on the oral cavity sizeV behind the tongue constriction. (Also for 
Japanese [m] vs. [n], see Hattori et al., 1958). The summary al'so shows 
frequency proximity of NZ and some specific formant, depending on place 
category: Cm] (N2) f tn] (M3), [Ji (N*0 f [•}] (higher than NM) (Fujimura, 
1963). A way to look at the distinctive role of NZ is by considering its 
position with respect to two adjacent fr)rmants (a pole-zero cluster) at 
different frequency regions according to na^al categories of different place 
•of articulation (Fujimura, 1963). According to data in Table 1, when values 
reported by different investigators are accounted for, such a classification 
procedure can be said to hold strictly for - [n] (N3 "between N4) and [Jl (above 
tHe general low formant structure, even above N6, according to Kacprowski & 
Mikiel, 1965), but not for [m], whose antiresonance can lie between N1 and N2 
or N2 and Nj, nor [J0 f whose antiresonance can lie between N3 and N4 or N4 and 
N5. This variability seems to be partly related to differenced in vGWel 
context.. 

Information about perceptual relevance of released vs. unreleased murmurs 
as place cues is provided by experiments in -which they were presented for 
identification alone or directly attached to the vocalic portion with no 
'formant transitions or release. I refer to results obtained for open vowels 
[a], [*] with labial, alveolar, palatal,^ and velar nasal murmurs in final 
position. 3 * ' * * *• 

In experiments with natural English speech, released murmur segments for 
[m] were Categorized quite accurately (80%-100%) whether ' presented in isola- 
tion or following [*] without formant transitions (Malecot, 1956). Henderson 
(Note 1) found for .the o.^rall vocalic "set a higher accuracy in pLace 
identification for [m] without transition or release (about 75%) than for [n], 
[J] "(65%— 75% ) ; following [a], in the absence of formant' transitions, the Efai]- 
murmur identification was always higher than 90%, independ§ntl*y of the 
-presence or absence of release. Identification scores for synthetic tm] 
murmurs in isolation-* with American English subjects are reasonably consistent 
with results with natural speech (65%-85%) (House, 1957; Nakata, 1959) . 
Moreover , Manrique de, Gur lekian, and Massone (1 980) report in experiments 
with natural Argentinian Spanish speech that not only isolated [m] murmurs but 
also isolated [n] murmurs give' a higher percentage of [m] than [n] identifica- 
tions. % 

v The syllable [an], with no transitions or release, is identified 50%-60% 
of the time (Henderson, Note 1); with release, the average rises to 97% in 
Henderson's experiment but not in Malecot's (50%). Both naturally-spoken 
(Malecoo, 1956) ^and synthetic (House, 1957; Nakata, 1959) tn] murmurs present- 
ed in isolation give the same 50%-60% effect. [n] murmurs not heard as [n] 
tend to be heard as [ml, as in' Manrique >de et al. (1980). 

Dukiewicz (1967) has shown that [Jvl murmurs presented in isolation to 
Polish speakers elicit no ty] judgments. Correct recognitiory of ^ improves 

30% when murmur is presented with its corresponding onset. 

* - 
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Isolated [9] murmurs are less well identified than those „f or [n] and {m] 
(Malecot, 1956: 12*; House, 1957: 62%: Nakata, 1959:- 41J8)j they tend to be 
interpreted mainly as [m] but also as [nj. .According to Maleeot, released 
mui^aur without transitions is very poorly identified (34?) and confused mainly 
with [*n]. Henderson also finds J murmur to be a poor indicator with vowel 
' [6] 1 (45l); however, for [atj ] ^without transitions, responses rise to 74* 
'(unreleased rfurraur) and 32% (released riurmur) . " 

• ^ Frequency valueb chosen in synthetic speech experiments vSnglish, *Ita4i-*' 
an, Polish) with fixed and variable* murmur patterns are another useful 
indirect source *pf reference in the investigation of murmur structures for 
nasals of differed place of articulation '(Table 2)\ While' experiments with 
fixed patterns "account exclusively for class properties of •cpnsonantal nasali- 
ty (section , 2.1. c.) , 'these variable pattern^ capture NT and ; NZ * iyn antics 
correctly., to simulate different place properties ( but reproduce .only -- to som-e 
degree the complex formant structure observed for different nasal consonants 
(gee Table t). 'A good approximation to NZ can -be obtained by replacing it, 
together with the two surrounding poles, by a single foraant having an 
appropriate * bandwidth. This method -may 'explain good perceptual results 
obtained by Nakata (1959) and Kacprowski and Mikiel (1,965) with initial 

-.- synthesis - vaflrues~-l-isted in- Table-2-. : * , 

A comparison of the- production* and perception data given above allows us 
to. -infer presumable place cues for nasal consonants. Thus, the remarkable 
importance of released or unreleased [m] murmur in place identification can be 

• related to the particular 'low N1 and NZ frequency value3 within an overall low 
X murmur spec^umr Such spectral c6nf iguratipn has been reported to differ 

consistently and strikingly from that of the other members of the nasal set 
(Table 1; Malcfcot, 1956; Romportl, 1973)- The perceptual importance of NZ for 
- [m] vs. [n] has been reported ]bjr De Mori et'al. (1979) and that of a strong 
" low spectrum component about 1000-1500 Hz in /the ckse of [m\ vs. a high 
concentration about 2300 Hz for [n] f -[^] tyf Delattre (1 958). 

Intraspeaker, and interspeaker inconsistency for murmurs other thafr [m] 
has been noted in different languages (Malecot, 1956; Delattre, 1958; Rom- 
portl, 1973). As already stated, "for [a] and [*], while [>J .murmur seems to 

• carry very little place infomation, [j] murmur and [n] murmur provide 
important place information- in thi unreleased case and are identified quite 
consistently' when released. perceptual distinction between ^L^J vs. L nj 
could be 'well cued byh}gh vsl low N1 and absence vs. presence of NZ at the 

'central region of the spectrum. Accordingly, a higher N1 value for L^J 1240 
Hz) than for [m], £n] (180 Hz) was reported to help perceptual place 
identification (Haskins Laboratories QPR 11, 1954)- NZ's being *ab6ve 3000 -Hz 
or absent in the case of [9] is consistent with Ohala's (1 975) * observation 
that its perceptual effectiveness is presumably severely attenuated because of 
that high frequency location. High, perceptual distinctiveness and, at the 
same time, similarity of murmur spectra (high N1 and NZ, similar N2-N3-N4 
configuration) between [9'] an<? accord well with the strong role, of LJ^J 

transitions in [y] identification (section 2.2.B.)* 

With respect to acoustic aspects other than spectral frequency, it has 
been found that N1 , N3 bandwidths for h] are greater than those for Lm], I nj" 
, and that [n] presents a very high degree of N2 damping with respect to LmJ 
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Table ^ 



Pole-zero frequency values (in'Hz) corresponding to the nasal ^ 
murmurs of [m], [n], [y] f fyL assigned to synthetic speech' 
stimuli in experiments with English (l , 2, 3)*f Italian .(5) • t 
. and Polish (4)* A summary of values is also included, \ 



[A] 



N1 


N2 


N3 


N4 


HZ** 


250 


•1360 . 


2200 




1550 


180 


1000 


2250 




. 750 


•200 


1100 


2500 






200 


1100 


2500 






.300 ,, 


1100 


2000 • 








,*4 









References 

(1 )Hecker (1962) 
(2,)House (1957)* 
(3)Nakat&(l959) 
(4 )Kacprowski 

4 Mikiel ^1 965) 
(5)?errero et 

al. .(1977) 



[n] 250 
' 240 
200 
- 200 
300 1 


. 1300 
• • 1 1 00 
1"700 
17Q0 
. 1800 


• .^0 2000 
2300 
2500 

. * 2800 
2800 




4O00 ' 
3500 


' (1> 
(2) - 
»(3) 
(4) 
(5) 




[]«.] 200 

400 


. 2300 
2300 


2800 
3000' 




S 


(4) . 

(5) - 




[ n ] 33o 

' 200 
260 


1300 
.2300 
900 


2200' 
2500 
^•2300 




1 

libove . 


(D 
(2) 
-'(3) 


5 



5000 





Summary (Extremes) 




[■] . 1 '80-3.00 


. 1000-1360 2000-2500 . 


750-1 550 


[n] 200-300 


1 1 00-1 800 2000-2800 


"3500-4000 


[y>] 200-4.00 


2300 2800-3000 




[ p ] 200-330 


900-2300 2200,-2500 

• 


* above 5000 «■ 
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; (Fujinura, 1962). High N1 damping ffcr t?] has also bS^fc reported by o*h$r 

• investigators (Fant, 1973; -Kacpra'nski., 1965). No evidence, about the^arceptu- 
al significance, of these differences'"^ available. The perceptual influence 

* of,T5oartiiul9tiion'eC£eats *due ko the , ad'jacent J) vowels upon place identification 
of a\jfasatf consonant has- been shown in experiments 'on' automatic speech 

, xecognition, (De Mori et al;, 1979); such influence is to be" expected fr.om 

* " significant, variations of the pole-zero structure of thc-Kasal murmur accord- 
ing ''to the vocalic environment.! -Moreover, the perceptual importance of 
relative amplitude levels at'^different spectral areas shoul/'Tje investigated. 

* In this respect it is highly plausible that, according to -data .reported by Ee1< 
' (1972-), energy minima at, typical NZ frequency zones (Section 2..'2.A:) convey 

relevant perceptual information in distinguishing nasal categories of differ- 
*ent place of articulation. * , 



B. Formant TrapsitiJns 



In Table 3 I present analysis frequency values for" F2-F3 transition 
endpoints as .well as positive (+), steady «( = ) and' .negative (-) transition 
ranges according to- data on syUables with [a] ancf Cm], ln3, Lj-J, T?J reported 
by several authors. 1 * The four languages that have have been chosen are. ... '7 

Hungarian, Ital'ian, Polish, and Russian. r . 

According to the summary presented at the bottom of Table 3, F2 for tj»0 
does not overlap in frequency with other categories and shows a constant » . 
.(positive) transition direction,. In fact, a'-250 Hz separation minimum between y 
F2 values for [n] and [jtf -is a good indication of high F2 distinctiveness for , 
M\ Such- a difference w?s also found for, Czech and- Russian by RortporU 
(1973) (Cn]:' 1100*1300 Hz; ' ly-h ' 1600-1800 Hz) . * , *, 

On 'the other hand, F2 transition values for lp overlap significantly 
with those for- other nasals, namely [m] and Cn], being in the central part, of 
the' overall -r'arige of endpoint frequency values, for the consonantal set. \ 
Variability of F2 transition values with [a] from .slightly rising to steady 
and slightly falling has also been noted for the velar rion-nasal correlates 
[r] [kl (Fischer-Jdrgensen, 1954; Halle, Hughes, & Radley, 1957, Potter, 
Kopp, & GVeen, 1947). See, however, Dalby & Port, W, who found quite 
strongly positive .F2 ranges)-, Flat transitions were found for English [<^J by ^ 
Green (1959). ' ' - \ 

Predictions of acoustic theory of speech production support F2-F*3 values 
reported in Table 3. Thus, a comparison with one of Fanfs nomograms U9bu, 
.p. 84). shows that, while given frequencies for labial and alveolar nasals 
match well with constriction .points' located near the lip opening * re m %'°™ant 
frequencies for [93 correspond to a constriction place at about 9 cm from lip - 
opening and 'those for [•p-U-.to a • constriction at about 4 cm. N^ive 
■ transitions for Imll are due to a complete., labial constriction. On the- other 
hand a more forward constriction . point for Cn], [J* than for [9] causes F2 
and F3" values for [a] to- increase as- a result of a decrease in front cavity- 
size Higher frequency for palatals than alveolars is presumably related to a 
greater increase in .the conductivity index of the internal resonator neck: 
palatograph^ evidence for Iji in different languages often shows alveolar- 
contact, as for- LnJ, plus several degrees of prepalatal and/or dorsopalatal 

contact. % 
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TablaJ 

Analysis fraquancy valuta (la Ma) corraapondlnf to F2-F3 
traaaltlona In ayllablta with [**] and [■], [n], [y], [9] 
in Hungarian (3), "Italian (5, 6), Follah (1 , 2) and Ruaaian (4). 
A axaanary/of valuta la alao inc^udad. 





Bndpointa 




Ranft a 




Direction 


Rafarancta 








* 


F3 








w 


1 000- 

1100" 


2850- 
3050 


-50/ 
-250 


♦50/ 

♦too 


♦ 


(1 )Dukit¥ics (1967) 












-/- 




(2)Jaaaaa (19S2.19& 




1050 


T695 


-380 


^500 






(3)«Mdica (1969) 




800 


2150 


-500 


r50 % 






(4)*ant (1960) 




1200 


2340^ 


-200 


-HO 






.(5)Vagiaa at 

aU (1978) 


[»] 


1250- 


2800- 


♦50/. , 


-150/ 


♦ 


-/♦ 


M 




1500 * . 


2900 


♦200 


>200 






(2) 




TT50 ^ 


T?50 


*?Z0 


^540 


♦ 




(3) 




1400- t 


'2250 " 


0 


-50 


• 




,(4) 




1500 












(5) 




1425 


266$ 


, *55 


♦40 








1450- ' 


2650 


' ^50 


-50 


-A- 




(6)FarVaro at 




1500, ' 












_ ai. (1979) 



t>] 1900- 
2300 

' 1^70 
o 2650 

2140 

2200 


2500- 
3150 

2032 . 

2850 

2990 * 
3200 


♦500/ ' 
♦1090 

"^495 
♦550/ 
♦650 
♦825 
♦650 • 


^450/ 
♦550 

T95 ~ 
♦550 

♦475 
♦350 


♦/-/- 

♦ ♦ 

♦ - 

♦ ♦ 

♦ ♦ 


.0) 

(2) 

(3) 
(4) 

(5)' 

■(6) " • 


[)] 900- 
' '1400 


2900- 
, ' 3200 

j* 


-50/ 
, -200 


-100/ 
♦60Q 


- -/♦ 
-/♦ 

-/- » _ 


(1) . • 

(2) , 
(3) 






Sumr> 






* • 


[a] 800- 
1200 


1695- 
3050 


-50/ 
4500 


-500/ 
♦100 


-(•) -(♦) 




[n] 1250- 

1,650 


1650- 
^2900- 


0/ 
♦220 


-540/ 
♦200 


♦(■)■ -(♦> 


• 


If] 1900- 
2300 


2030- 
3200 


♦495/- 
^♦1000 


-450/ 
♦550 




-) - ' 


[nj 900- 

1£00 


2900- 
3200 


-50/ % 
-200 


-100/ 
♦600 


-(-, -(♦) 




r 1 
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Besides conveying- manner information, F1 transition values seem to 
contribute to place identification* Thus, while the ifl transition is 
extremely negative (Fant,-1960; Vagges, Ferrero, Caldognetto-Magno, & Lavagno- 
li t 1978), that for [9] is usually only slightly negative and can even be 
positive (Dukiewicz, 1967). For [a] followed by LjJ f the negativity is due to 
an important increase of back cavity size and a noticeable increase of vocal 
tract constriction (Delattre, 1951} Faht, 1960). The slightly negative F1 
excursion between [a] and [^] is related to a smaller increase in pharynx 
cavity size, - s - ^ 

* Available (lata on perceptual experiments with natural- speech and synthet- 
ic speech give information about the relevance of formant -^transitions in place 
recognition. According to Henderson (Note 1), sequences of 'fa] followed by a 
nasal consonant, without nasal murmur and with or without final release, give 
60* of [m] responses when presented with Cm] ^transitions and 80% of [n] 
"responses with [n] transitions but only 1551-30* of [J] . responses with [pi 
transitions in favor of a majority of Can] judgments. F.or LajJ, with natural 
Polish speech, Dukiewicz (1967) found that transitions compensated for the 
negligible place information conveyed by murmurs (section 2. 2. A.). 

In Table 4 I present transition frequency values reported in perceptual 
tests, with English-speaking .subjects for synthetic [m], [n], [J] and vowels 
[a], .[<]•• Unlike the murmurs, ' there is much available perceptual data on 
transition cues. A comparison of -values for [a] in Tables 3 and 4 shows that 
F2 frequency values categorized as [J] in synthetic speech experiments (1920- 
2300' Hz) correspond exactly to analysis values for F2 of [Jv] while analysis 
values for F2 of [0] correspond to vaJLues - categorized as Lm] and Ln] in 
experiments with synthetic speech. It also suggests that, in the absence of 
[>] as a labeling category, .the E2 difference of 250 Hz (1650-1900 Hz) between 
transition frequency values appropriate for [n] and [JO (Table 3) was 
interpreted exclusively as Ln] by English listeners (Table 4). On these; 
grounds, it seems clear that stimuli with DfO-like transitions were interpret- 
ed as [0] by English listeners but might well have been categorized as [J»Q by 
speakers of other languages, while stimuli with [^-like transitions, in the 
Absence cf an appropriate negative F3 and some [;]-like murmur spectrum, were 
interpreted as [m] or [nh This view is supported by observations reported in 
•the literature. Thus experimenters at Haskins Laboratories stated that while, 
with no F3, "a. large plus F2 transition is heard as lyi... rather than l?J, 
with a -3 transition* positive F2 transitions are now heard clearly as [?]" 
(Haskins Laboratories QPR 8, 1953, pp. 21-22), Direct evidence about the. 
effect of a strongly rising F2 in cueing palatals is also provided by Derkach, 
Fant, and Serpa-Leitao" (1970) for Russian palatalized consonants. Thus, they 
found this F2 transition type to be the most relevant perceptual palataliza- 
tion cue with vowel La]. Consistently with the contrast in formant transition 
patterns for [>] and [9], important improvements i" identification of the non- 
nasal stop LgJ without '<F3 transition (Liberman et al., 195*4) were found to be 
dependent* on the presence of an optimal - 480 F3 (Harris, Hoffman, Liberman, 
Delattre, 4 Cooper, 1958). In' studies on 'speech analysis listed in Table 3, 
the direction of F3 was found to be predominantly negative as well. 

In the light of previous comments, there are reasons, to .believe that 
degrees of F1 excursion should be included as variable parameters in perceptu- 
al continua of place identification for -nasals. Thus, -as shown on other 
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Table 4 



Frequency values (in Hz) correspond ing^ to F2-F3 transitions in 
syllables with [a], [«.] and [■], Ln], [7] obtained from synthesis 
experiments with English. A summary of values is also included. 



Endpoints 
F2 F3 



[m] ar.700 
(Locus) 
900- 
1450 
ar.1300 

1 000- 
1300 
920- 
1300 



2000- 
"2300 



Ranges 
F2 



-190/ 

+360 

-200 

-500 

-310/ 
+70 



- Direct ion 
F3 F2 

-A 



F3 



-200/ 
+1 00 



References 



(1 )Liberman et 

al. (1954) (M) 

(2) Nakata 
(1959)(W) 

(3) Hecker 
(1962)([«]) 

(4) Garcia (1966, 
1 967a, b) ([«]) 

(5) Miller &'Eimas 
(I977)([a]) 



[n] ar.1800 






+360 


+ 


+ 


(1) 


(Locus) 












(2) 


1450- 




+3/60/ 




+ 




1890 




+800 








(3) 


ar.180C 




+300 




+ 




1300- 


2400- 


-500/ 


T200/ 




+ 


(4) 


1-920 


3000 


+1 20 


+800 






(5) 


1300- 




+70/ 




+ 




1700 




+470 











[9] ar.3000 

(Locus) 

1920- 

2300 

-2000 

1920- 
23CO 



-360 



2200- 
2400 



+830/ 
+1 210 
-+200— 
+1 20/ 
+500 



0/ 

+200 



+ 
+ 



(1) 

(2) 

(3) 
(4) 





Summary 






[m] 900- 
1450 


-500/ 
+360 


-200/ -(+) 
+1 00 


-(♦) 


[n] 1 300- 
1890 


' -£00/ 
+800 


+200/ +(-) 
+800 


+ 


[•O] 1920- - 
2300 


+1 20/ 
+1 210 


-360/ + 
+200 


-<-,/♦) ' V 

> 



200 

• Hue 



grounds by Derkach et al. (1970), a strongly negative F1 transition is to be 
considered a relevant palatalization cue. The perceptual relevance o^ a 
higher F1 starting-ending point in identification of a non-nasal velar tg3 vs.- 
[dJ f CbJ has been pointed out by Fujimura (1971). 

Individual and overall duration of formant transitions can be shown to 
differ among place categories. It is significant that details about this 
aspect (longer transitions for ty], shorter transitions for [mJ and In,]) have 
been taken into consideration in works on speech synthesis (Hecker, 1962; 
Mattingly, 1968; Nakata, 1959). One must point out that such observations 
about relative transition duration for different categories mean little unless 
vocalic context is kept constant. In such circumstances, duration of F2 
transition was found to be a rather important cue for the identification of 
different Polish nasal-vowel sequences (Kacprowski 4 Mikiel, 1 965.) . 

C. Interaction of Cues 

In sections 2.2. A. and 2.2. B. I have taken into consideration how formant 
transitions and nasal murmurs contribute separately to the identification of 
nasal consonants of different place of articulation and what acoustic traits 
make transitions and murmurs perceptually relevant in auch an identification 
process. Results show that: 

(1) either [ml transitions or [m3 murmur are sufficient place identifi- 
cation cues for tm]. The strong perceptual role of tm] murmur # in [*] f Ca3 
environments is to be particu?arly emphasized (Carlson, Granstrom, 4 Pauli, 
1972; Henderson, Note 1; Malecot, 1956); 

(2) [n3 transitions are more powerful place cues than tn3 murmur for [n] 
identification; 

(3) [93 cues, transitions, murmur, and release, are needed for a 
satisfactory [?3 identification with vowels M t [*3 but not with' taJ. In 
this case [?3 murmur is a stronger cue than [^] transitions; 

(4) [>3 transitions but not tyl murmurs are sufficient place identifica- 
tion cues for [J*0. 

The arrangement of cues in running speech suggests that place cues for 
nasals ought to be explored interdependently instead of in an isolated way . 
Any attempt to detect them should focus primarily on the interactive role of 
transition^, murmurs, and releases. Results reported by experiments with 
natural and synthetic speech, already discussed, can 'be said to adduce some 
valuable (although indirect) -information to this issue. Such experiments 
presented murmurs in isolation for identification (Dukiewicz, 1967; House, 
195?; Malecot, 1956; Manrique de et al., 1980; Nakata, 1959) or murmurs or 
transitions combined with the vocalic steady state ^portion, with or without 
final release (Male'cot, 1956; Henderson, Note 1). But, none of such attempts 
succeeded in combining all cues reciprocally to detect possible cross-category 
effects. A more realistic approach is reported by MaKfcot (1956) who, in 
addition to th<> experiments mentioned- above, compared the cue value of 
transitions and released murmurs extracted from natural speech utterances with 
[*] and [m], [n3, tyll stimuli were tested with American English subjects. 
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According to Malecot's data, [m] cues (murmur and transitions) override 
transitions and murmurs f6r [n] and [*)] f and [n] cues (murmur and transitions) 
override transitions and murmur for [^]. These overriding m effects are 
significant in all cases except for [n] murmur upon [p] transition (52% to 
48%). They or eport valuable information about cross- category effects of 
transitions vs. murmurs and, therefore, are of -particular interest for my 
experiment on Catalan nasals. 

D. Summary 

It has been shown that place cues for nasals are complex. In order to 
characterize them satisfactorily, experiments with natural and synthetic 
speech need to be appropriately designed. Some relevant suggestions about 
this subject, to be taken into consideration in further research and partly 
accounted for in my perceptual experiment on Catalan nasal consonants, are 
presented in the following paragraphs. 

Synthetic stimuli have to recreate -the temporal arrangement of cues found 
in natural speech. Results about perceptual relevance of cues summarized in 
the preceding section are to be taken with care. Thus, except for those 
reported in Malecot'3 experiment on - interaction of cues, they -derive from 
experimental paradigms in which the arrangement of cues in the stimuli can be 
shown -to deviate from the arrangement of cues observed in the syllabic 
structure. This is not only the case for isolated murmurs but also for 
combinations of segments: abutting the remaining portions of the signal 
(Maldfcot, 1956; Henderson, Note 1) when murmurs or transitions have been 
removed clearly alters the timing relationship between all VC cues and, as 
Henderson herself has pointed out, might cause masking phenomena when transi- 
tions and final release are presented in succession; on the other hand, 
preserving the timing relationship between remaining cues would leave unnatur- 
al silent portions in the stimuli. 

Malecot's experiment on interaction of cues accounts for their temporal 
arrangement in the syllable but does not provide any evidence far place 
information conveyed by different transition characteristics and* murmur spec- 
tral regions* A speech synthesis paradigm Is needed for this purpose°*in which 
frequency values are close to - analysis data of veal speech. Moreover, 
acoustic traits of transitions and murmurs need to be varied simultaneously so 
that results t do .not refer independently* .to optimal transition* cues and optimal 
murmur £ues but to unitary transitiori- murmur 4 oues. In fact, -data reported in 
this paper reveal that murmurs and transitions for [m] , [n] , [y] , [9] are 
perceptually complementary similarly to bursts and transitions- for non-nasal 
stops'and, consequently, are integrated analogously in "the perceptual process 
with reference to well-stated production constraints. 

1 In the .following experiment, I try to put into practice these views on 
research strategies. I investigate the perceptual effects of acoustic cues in 
combination, using Catalan, subj ects, .by contrapoging transition and unreleascd 
murmur patterns for final nasal consonants of different place of articulation. 
In contrast with the experimental paradigms reviewed, the arrangement of cues 
within the syllabic structure is preserved while varying transition and murmur 
simultaneously and systematically. Although the perceptual relevance of all 
individual acoustic parameters is not tested, such a dynamic approach allows a 
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better understanding of their role within, the overall transition and murmur 
patterns as veil as the VC structure. Analysis data that have been* reviewed 
and theoretical issues that have been raised as well as additional information 
t obtained from Catalan speakers are taken into consideration. 



3. AN EXPERIMENTAL STUDY ON CATALAN NASAL CONSONANTS 

1 • Phonetic and Phonological Description 0 

In Eastern Catalan, phones [mj, [n], [j], [*>] appear in absolute final 
position. [m], [n], [j^J also^ appear intervocalically and correspond to 
underlying /m/, /n/, /jv/. [j] is found wort-internally^nl^_immediately 
before [kj and, in final position, occasionally in free variation Witlr [^kj. 
depending on the speaker and the lexical item. These phonetic facts argue for 
[pj being an allophone of underlying /n/ before a velar stop, generated by the 
following* phonological derivation (see also Mascarcf, 1978): 



"Underlying form" 
Regressive Assimilation 
Optional Deletion 
Devoicing 

Surface form 

'blood' 'bench' * 

The presence of underlying /g/ and /k/ accords well with the phonetic 
realization of derived formations such as [sapgi'nari] 'bloodthirsty', 
[s^gu'nos] 'bloody', [ b*>' ketaj* low bench'. Other minimal pairs with 
contrasting nasal stops in final position can be found. Thus, [ fam] 'hunger' 
[fan] '(they) do', [ f art] 'mud'; [bam] 'Aux. we go', ^[ ban] 'edict 1 , LbajvJ 
'bath' [bcu)J 'bench'. 

Pinal [m], [n], Lit], [p] are weakly released or unreleased according to 
- individual speakers. J Given the occurrence of the full set of nasals of 
• different place of articulation and the unreleased murmur condition, it is 
possible, then," in Eastern Catalan, to investigate %he role that transitions 
and unreleased murmurs play in the process of identification of different 
final place categories. 

m 

2. Production Study 

Analysis values corresponding to production data from * a single male 
Catalan speaker were chosen as reference points for the/patterns to be used in ^ 
' synthetic speech experiments. Several samples of monosyllabic minimal pair^.'~y? 
were analyzed by means of a Voice Identification sound spectrograph, a digita^P*^ 
spectrographic analyzer and a linear prediction model analysis. To see to 
what extent acoustic patterns found - in production data on nasals from this 
reference speaker were in accordance with those of other Catalan informants, 
data on the same minimal pairs embedded in a neutral sentence were also 
collected from 12 other.male Catalan speakers and analyzed spectrographic ally. 
Frequency values for the reference speaker as well as range of frequency 
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/sang/ 

(o) 

(k) 



/bank/ 
? 

(0) 



variation and mean frequency values for the other 12 speakers are presented in 
Tables 5 and 6. Both sets of data are jcompared below and discussed in the 
light of theoretical predictions and data from the literature. 

Nasal murmur values for the reference Catalan speaker (Table 5) are 
consistent with those given i*n Table 1. The structural configuration of poles 
and zeros in Table 1 is violated by N3 and N4 for [jO- t which, as for the other 
Catalan speakers, happen to be higher than that for [p].5 The continuity 
between F3 transition and N4 for [Jt] suggests that palatal, N4 is mouth cavity 
dependent. Large NV bandwidth values for [p] are consistent with data 
obtained by other investigators (section 2.2. A*). 6 As shown in the same table, 
the other Catalan speakers differ from the reference speaker in that they fail 
to show a contrast between Cn] and [p] with respect to N3 and N4. This fact 
suggests that, if murmur spectra are found to convey relevant ini^rmation in 
discriminating Cn] from [p] f perceptual distinctiveness is to be assigned to 
contrasting values for,N1 and NZ. It would be interesting to test the index 
^of perceptual confusability between Cn] and [p] in Catalan using real speech 
stimuli with tokens of unreleased [p]. \ 

For Catalan speakers (the reference speaker and 12 others) the general 
direction corresponding to F2-F3 transition mean values (Table 6) is consis- 
tent with that reported in the current literature on synthetic speech 
experiments for Cm], Cn], [p] in English (Table 4). Also cross-category F1 
values can be predicted on the basi3 of the acoustic theory \ of speech 
production. F2, not only for [jj but also for [n], is consistently «positive«*< 
(*ee fable 3 for comparison), even wher^ extremes of -th^ observed rajige of 
values are taken into consideration. While F2 values for Cm] and [Jt] fall 
/well apart from those for Cn] and [p], respectively, values for [p] overlap 
significantly with those for Cn] arid even Cm]. This is consistent with the 
fact that an appropriate F3 is needed to synthesize a satisfactory, unambigu- 
ous velar nasal consonant. 

The nasal murmur was at least 1.5 times longer than the preceding vocalic 
segment for the reference Catalan speaker. Transition durations for [jO were 
consistently longer (70 'msec average) than those for [n] (50 msec average) and 
[0] (35 msec). For this speaker, as well as for many other Catalan speakers, 
positive F2-F 3 excursions were still observable during the nasal murmur period 
as an effect of the dynamic motion exhibited by the large mass of tongue 1>ody 
towards the dorso palatal region. The perceptual relevance of the timing 
relationship between the gliding movement and t'.ie nasal closure onset has not 
been investigated in my perception experiment: nasal formants were kept 
steady instead, as found in the productions of some Catalan speakers. Murmur 
release and final voiceless stop after [93 were present or absent as predicted 
, in section 3.1. 

3. Perceptual Study 

A. Procedure 

• 

To explore the perceptual role of transitions and unreleased nasal 
murmurs in place recognition as well as to detect identification cues for Ifl, 
continua with Can], [a]0, Can] were synthesized in two parallel blocks of two 
slightly different tests each (1a, 1b; 2a, 2b) according to analysis values 
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Table 5 



Analysis frequency values ( ranges and means) (in Hz) for murmurs 
in VC syllables [.am], [an], [ajv], [ap] according to data from a 
single Catalan speaker and 12 other Catalan subjects. 



N1_ 


N2 


N3 


Mi 


NZ 


• Subjects 


[m] 200 


1120 


1360 


2100 




(1 )SingJ.e subject f 


170- 


910- 


1 120- 


1370- 




(2)12 subjects' 


320 


f 105 


1510 


1800 




(range values; 


255 * 


1015 


1300 


1565 


b . : 


(3)12 subjects ' , 










(•mean values) 


In] « 200- ' 


800- 


1460- 


1 950- 


1780 


■ d) 


300 


900 


1650 . 


2100 






BW:180 




BW-.330 


BW:235 




(2) 


225- 


880- 


1440- 


1775- 




325 


1135 


1640 


2600 




(3) 


285 


1035 


1515 


2130 




M 180- 


900- 


2000- 


2900- 


2360- 


(D - 


230 


1150 


2200 


3350 


3000 




BW:150 


BW:220 


BW:140 


BW:140 




(2) 


200- 


800- 


1365- 


1740- 




340 


1180 


2335 


3000 




' (3) 


' 295 


1055 


1760 


2265 




[n] 300- 


1150- 


1860- 


243CT- 


2900- 


(D 


' 400 


1240 


2200 


2650 


3400 




BW:275 


BW:200 


BW:300 


BW:250 




(2) 


225- 


90C 


1375- 


1 960- 




360 


1240 


1640 • 


2730 




(3) 


295 • 


1060 


1530 


2160 
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Table 6 



Analysis frequency values (ranges and means) (in Hs) for transitions 
" in VC syllables [mi], Un], [*J>], according to data frc* 
a single Catalan speaker andt12 other Catalan subjects 

Subjects 



[-] 



V steady-state 




Endpoints 




ZL 11 


F3 


F1_ 


tL 


F3 


920 ^1435 


2420 


. 800 


1150 


' 2420 


630- 1110- 


1940- 


525- 


980- 


1880 


$25 1580 


2250 


765 


1560 


2250 


?25 1 290 


2130 


655 


J215 


2075 



(1 )Single subject 

(2) 12 subjects 
(range values) 

(3) 12 subjects 
(■ean values) 



[n] 920 

570- 
805 
' 715 


1535 

1200- 

1510 

"1325 


2520 

1955- „ 

2610 

2300 


775 

525- 
760 % 
' 685 ■> * 


1540- 

1630 

t350- 

1660 

1560 


2600 

J 955-* 
2610 
2365 


,0) 

(2) W ' 
(3) 




[>] 920 

685- 

875 

755 


1535 

1200- 

1570 

1400 


2520 

1975- 

2700 

2315 


450- . 

550 

405- 

805 

635 


1840- 

1980 

1^25- 

2050 

1860 


2750- 

2850 

2025- 

2925 

2440 


(1) 
(2) 
(3)' 




[?] 920 

685- 

900 

770 


1535 

1285- 
'1680 
1440 


2520 

1995- 

2700 

2255 


. 900 

645- 

900 

740 


- 1400- 
1640 
1310- 
2025 
1550 


2125- 

2500 

I860- 

2475 
2140 


(1) 
(2) 
(3) 





Hangs s 



Direction 



[;] -! 



20 



-100/ 
♦10 



-1 35/ 
'♦105 
-75/ 
♦450 



-395/ 
-20- 
-315/ 
♦90 
-=65 — 



. . -/♦/- 
.(-) ♦(-> 



(1) 
■) (2) 

K 



4 



F1 

[a] -120 
-155/ 
-25 
-70 


P2 

-285 
-270/ 
♦30 
-75 


F3 

0 

-110/ 
0 

-55 


: :< :s 

0 


F3 
-(-) 


(1) 
(2) 

(3) 




[n] -145 

-90/ 
• 0 
-30 


-5/ 

♦95 

♦115/ 

♦300 

♦235 


♦80 

-190/ 

♦285 

♦65 


♦(-) 

-(-) 


♦ 

♦<-. 

♦ 


to 

-) (2)" 
(3) 




[V ] -470/ 
«/ -370 
-425/ 
♦50 
-120 


♦30t>/ 

♦445 

♦200/ 

♦630 

♦460 


♦230/ 
♦330 
-285/ 
♦435 
♦1 25 


-<♦. -) ♦ 

♦ 


♦ 

♦(-. 


(1) 
-) (2) 
(3) 
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for the Catalan reference speaker displayed in Tables £ and 6.- The! synthetic 
stimuli were prepared using a software serial forraant synthesizer (SYNTH) at 
Haskins Laboratories with variable BW parameters, ar\ extra pole (used as N2) 
and an extra zero (ased as" NZ) (Mattingly, Pollock, Levas, ' Scully J & Levitt, 
1981). 




wiirtJt; ixA^uy upgiumx v> a Qnoi %f cun _pc* v v c i uo • ah ^wuwa vti previous expei — 

mental studies, these two conditions allow us to determine identification 
frequency ranges across place categories for transitions and murmurs as well 
as to investigate more adequately the interaction of the two acoustic cues 
within a syllable structure recreated from that of natural speech utterances. 
Actual values are given in Figures 1 and 2. Poles, of tti# murmur pattern are 
represented by. single lines and zeros by double lines. Test 1a differs from 
1b and 2a from 2 b in vowel steady state and transition end point vafLues. Two 
versions' of the same experimental design were given simply tol test the 
perceptual effect of a larger variety of F2-F3. transition values and, 
therefore, to be able to. determine ^identification cross-over polints_with 
higher precision. I give some details about the variable transition ^endpoihts 
(Figure 1) and variable murmur structures (Figure 2), which will be tpken into 
consideration in the discussion section on perceptual data obtained from 
Catalan informants: 

a. ' F2 transition end points: 

Test la: From 1430 Hz (-70) 'to 1 920 Hz (+420) in 7 steps of 70 Hz; 
Test 1b: From 1600 Hz (0) to 2000 Hz (+400) in 8 steps of 50 Hz. 

b. F3 transition end points: 

Test 1a: From 2340 Hz (-160) to 2900 Hz (+400) in? aitfps of 80 Hz; 
Test 1b: From 2300 Hz X-300) to 3100 Hz ( + 500) in a/steps of 100 Hz. 

c. Nasal murmur structures: 

N1: From 250 Hz (6 steps) to 350 Hz in 5 steps of 20 Hz; 

N2: From 900 Hz to 1200 Hz in 1 1 steps of 30 Hz; 

113: From 1600 Hz to 2100 Hz (6 steps) in 5 steps of 100'tfz; 

N4: From 2500 Hz to 3000 Hz and vice versa in 5 steps of 100 Hz; 

^NZ: From 1800 Hz to 3200 Hz in 1 1 steps of 140 Hz; 



Formant cmnd width values were also varied across stimuli according to 
frequencies included in Figures 1 and 2. A constant value of 900 Hz for F1 
was chosen. - Preceding [aj was nasalized from vowel onset to transition 
end point by means ofa\progressive frequency rise of a single low pole (500 .to 
'6 00 Hz) - zero (500 to tQQ Hz) pair. Each stimulus was 560 msec long, having 
~a yowel steady state of 20<Xmsec, a transition ofHSOTmsec^ia^ 
of. 300 msec. There was a progressive 10 dB decrease i*rom vowel to murmur and 
an FO lowering slope between vo^l onset (120 Hz) and murmur offset (80 Hz). 




Frequency _ 
(in Hz) Transitions 



9000 



2600 



2200 



1800 ' 



1400 



1000 
600 



200 



Mb) 
(la) 



F3 




(lb) 
(la) 



F2 




260 



300 



200 



130 



Murmur* 



150 



150 



200 



190 



(p) 



260 



300 



200 



300 



Figure -I - Synthesis patterns for tests' la and lb (fixed murmur conditions) 
with inclusion of bandwidth values (in Hz) for murmur formants. 



214 



ERIC 



3000- 



2600 



2200" 



1800 - 



1400 



1000 



600 



200 



C2& 
(2a) 



(2b) 
(2*) 



F3 



F2 




Murmurs , 



170 160 170 1 



- — 100 
, — ^210 



— iod 



260 



230' 



210.230 



250 



— 160 180 210 240 270 300 

210 

270 

300 240 



200 200 200 ^ 200 200 200 jjoo 200^ 



1?0 142 164 188 178 180 212 234 266 ^SOO 



Flguro 2.-8ynthosls patterns' for tests 2a and 2b (I ixod transition condHion) 
wHh Inclusion of bandwidth vakios (In Hz) for murmur formants. 
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Every 'test was administered using several stimuli per s' <*p and the 
overall set of stimuli randomized before presentation for identification. 
Overall, test 1a was composed of 144 tokens (6 per step), test 1J> of 162 (6 
per step), test 2a of 165 {5 per step) and test* 2b of J 76 * (4 per step). 
Intervals of '4 sec were included between successive stimuli and longer 10 msec* 
intervals after every ten stimuli. Twenty- four paid Catalan subjects took 
each of the four tests twice " and. were asked to identify orthographic ally the 
ilnal nasal stop as [n], or They were all students who had" had no 

previous experience with synthetic speech and did not know the purpose of the 
experiment. Thirteen to'ok the tests binaural ly through headphones; the rest 
"listened to stimuli reproduced through' a* loudspeaker because of problems 
connected with taking the* tests in the " field. To find out -whether such 
^different listening ccmditions could have had some strong effect on subjects 1 
responses, I listened to stimuli under both conditions and obtained almost 
identical cross-over points and response distributions. 

B. Results 

Table 7 shows category judgments for each test in all variable 
conditions; data from all subjects and from the most consistent 14 label ers 
are reproduced separately. Results for ' each variable" condition have been 
displayed in Figures 3, 4, 5, and 6. Figures 3 and 4 give perceptual data 
obtained from tests Ja and 1b, and Figures 5 and 6 give data from tests 2a and 
2b. Figures 3 and 5 give data from all 24 Catalan subjects an} in Figures 4. 
and 6 data from the most consistent labelers (14 selected subjects). Each 
subplot in Figures 3 and 4 represents judgments for a particular murmur; 
stimulus numbers for different F2-F3 transition values lie on the abscissa. 
Each subplot in Figures 5 "and 6 represents judgments for a particular F2-F3 
transition set; stimulus numbers for different murmur values lie on the 
abscissa. Among all subjects those who identified the stimuli with most 
consistency were chosen as "best" labelers. A comparison of identification 
curves obtained from these 14 selected informants with those obtained from all 
24 shows that, as expected, the best labelers categorized ^ stimuli more 
^distinctively. Thus, the following summary on perceptual data about LnJ, Lj4i 
[p| identification will refer mainly to responses obtained from the most 
consistent Catalan subjects. 

a. Table 6. -A comparison of percentages of. category identification 
between murmurs and transitions shows that the labeling of appropriate 
transitions is always more consistent than that of appropriate murmurs. The 
effect of transitions vs. murmurs" in both sets of tests (la-2a, 1b-2b) is much 
higher for j>] (2.5-2.8 ratio) than for [n], M, and for [n] (1 .4-1 .7 s ratio), 
than for [*)) (1.1-1.6 ratio). 

Tests 1a and 1b show that an optimal [ju] murmur does not favor the 
identification of one or another of the _place categories.^ An optimal [n] 
murmur slightly favors identification of [nj, M vs. [^J (see test 1b). An 
optimal murmur favors significantly [rj ] vs. [n] responses (1.5-1.9 ratio). 

Tests 2a and 2b show that the presence of optimal [^] transitions 
correlates significantly with [9] vs.tn] responses (1.1-2.1 ratio) while that 
"of^yRim~M UJ vs - L9J responses 

(1.1-2.1 ratio), independently of -M identif icationr - Appro priate^L]tJ_Jransi- 
tions contribute exclusively to [jj identification in a range of 90. 9^-95. !%• ' 

2io 
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Table 7 



v Category judgments for ea~h test (percentage). Data fr&jn all 
24 subjects and ^the best H label ers are displayed separately. 



Ljl 1*1 ' .LjJL 



Test 1 a 
[^] murmur 
[n] murmur 
[j] murmur 



40.8 


26.6 


32.6 


42,7 


24.9 


% 32.2', 


30.3 


32.9- 


36.8 


28.4 


34.5 


37 


33 


31.2 


35.8 


♦32.3- 


34.4 


33.3 


t 

•44.1 


1 23.6 


32.3 


45.1 


22.6 


32^2 


26.7 


37.1 


36. J 


23.'3 


40.4 


36.3 


32.5 


31.5 


36 


32.3 


34.7 


33. 


■ 63.3 


35.3 


0.4 


68.4 


31.1 


0.5 


46 


51.2 


2.8 


39 


60.4 


0.4 


1.5 


2.8- 


95.7 


0.6 • 


4.3 


95.1 



' Subjects 



(1 )A11 24 subjects 
(2)Best. 14 label lers 
(1) 
(2) 
(D 
(2) 




(D 
(2} 

.(•1 ) 
(2) 

(D 
(2) 



• Test 2a 



[y] transtions 
[n] transitions 



transitions 



/ 



(D 
(2) 
(D 
(2) 
(D 
(2) 



Test 2b 

[^] transitions 
[n] transitions 
[j-] transitions 



53 


45.4 


1.6 


54 


45.9 


0.2 


37.9 


54.8 


7.1 


30.4 


65.7 


3.9 


2.8 


4.1 


93.1 


2.6 


6.5 


90.9 



(1) 

-(2) 
(D 
(2) 

(1) 
(2) 




io ' 2 4 ft ft 10 2 ' 4 • « io 

Figura 6>P«rc«ptual results for tasts 2a(abova)and 2b(balow)(aH 24 Catalan tubjactt). 
Ordinata:parcantaga of ratppnsas with Indicated labal; abscissa: stimuli. 
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Transitions 




2 4 6 8 10 2 4 6 8 10 2 4 6 .8 10 

Rcurs 8.-P*rcsptual rssuHs for tssts 2a (abova) and 2b (balow) (14 baat Catalan labalara). 
Ordlnata:parcantags of raaponaaa with hidicatad labal ; abaciaaa:atMiull. 
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b. Tests 1a, 1b (Figures 4 and 5)-Identif ication peaks show that 
category judgments cannot be predicted on the basis of appropriate murmurs but 
of appropriate transitions, especially for the [n] and [p] murmur conditions. 
I characterize below an optimal set of F2-F3 transition directions ariti range 
values (in Hz) for each place category, according to perceptual data reported 
by all Catalan speakers with special reference to the most consistent 
labelers: 

[7] F2: slighty negative to slightly positive (-70 to +§0) 

F3: strongly negative to steady (-300 to 0 ) 

(+140 to +250) 
( 0 to +200) 

(+280 to +420) 
(+200 to +500) 

Murmurs appropriate for different categories have no effect, upon optimal 
transition values for [^] and [y0. While [jvl murmur has no effect upon 
optimal [n] transitions, a significantly higher average of [p] than [n] 
judgments is 4 obtained for typical [n] formant values (F2: up to +260; F3: up 
to +230) followed by Ljl murmur. 

b. Tests 2a, 2b (Figures 5 and 6 ^Identification peaks • show that 
category judgments can be highly predicted on the basis of appropriate 
transitions. This is clearly the case for the l]d , transition condition: 
while no optimal [%] murmur can be found along different murmur continua, 
Optimal [rO transitions override completely tn] and [9] murmurs. Therefore, 
'an optimal set of murmur values (in Hz) for [9] and In] is to be exclusively 
found in the cake of the Lpl and [n] transition conditions: 

[n] N1: 350 N2: 1200 N3: 2100 N4: 2500 NZ:* 3200 

tn] |M1: 250 N2: 960 N3: 1800 N4: 2700 NZ: 208C 

The perceptual effect of these optimal murmur values is obvious: "for LJ1 
and [n] transitions the percentage of [0] responses increases as the optimal 
t^] murmur (stimulus 11) is approached and that of [n] responses also rises 
towards the optimal [n] murmur (stimulus 3). It is the case that optimal 
murmurs for M land [n] are shown to be dependent upon [n] and [J] transitions 
respectively:^ thus, the percentage of [p] responses for the optimal [?] 
murmur is significantly lower with [n] than Ipl transitions and that -of In]- 
responses Tor thfe v optimal [n] murmur is significantly lower with [p] than [n] 
transitions. Moreover, it must be noticed that, while [n] transitions never 
override optimal] [^] murmurs, [p] transitions are shown to override optimal 
[n] murmurs in test 2a (Figure 6). 

C. ^ Summary and Discussion 

Perceptual data from Catalan subjects indicate that, overall, for vowel 
[a], transitions provide more effective pues for nasal consonants of different 
place of articulation than murmurs. This is consistent with results obtained 
in previous experiments with synthetic speech tested with American English 
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[n] F2: positive 

F3: steady to positive 

[Jul F2: strongly positive 

F3: strongly positive 



speakers. In agreement with Henderson's (Note 1) results, it has been shown, 
however, that the contribution of murmurs in place identification is much 
higher than that suggested by data reported in most of those experiments, and 
category-dependent as well. In fact, my results confirm that a better 
characterization of place cues, for nasals must be strongly related to a 
considerable , improvement in the construction of the experimental paradigm 
selected for perceptual testing. I draw next a summary of the perceptual 
results on Catalan and evaluate them in the light of material reviewed in 
Section 2.2. 

' While, as stated, transitions are more powerful cues than murmurs in the 
process of identification of nasals, important cross- category differences can 
be established. This effect is highly relevant for [jvj, and more relevant for 
[n] than [jj. ^Reciprocally, [j] murmurs contribute more to place identifica- 
tion than other murmurs. This contrast in perceptual relevance of cues is 
consistent with the tendency of [9] murmur to prevail over [n] transitions and 
not vice versa in what may be called an inter- category trading relation as 
characterized below in this discussion section. On the other, hand, while [y] 
transitions override murmurs appropriate for other nasal categories, optimal 
[j>] murmurs have been shown to convey no place information. This negative 
effect may have been maximized by not having taken into consideration, in the 
synthetic reproduction of murmurs, the characteristic [j-] glide component 
during., murmur 1 Spectrographic analysis reveals that, while very little 
movement can be detected for nasal formants during the closure period in the 
case of [n] arid [9], those for [3*] sHow a continuation of positive excursion 
with respect to F2-F3 transitions. Since F- transitions result from articula- 
tor dynamics, we have, apparently, a continuation of tongue movement during 
lingual, closure and complete oronasal coupling. All these findings about 
, relevance of transitions and murmurs in the identification of different place 
categories are significantly consistent with the summary of interactive 
perceptual effects included in section 2.2.C., based upon production and 
perception data from other languages. 

Optimal cues obtained for [9] indicate that transition .direction for F2" 
is not perceptually relevant as long as it remains close to the vowel steady 
state frequency; however, F3 transition must be negative for a satisfactory 
[n] identification. Optimal [<0] murmur, on the other hand, ,is characterized 
by a high N1 and the absence of NZ at the central part of the spectrum. 
Identification of [n] is mainly dependent upon a constantly positive F2, for a 
steady or positive F3*. Optimal alveolar murmurs have a low N1 and, an NZ 
between H3 and, N4. .These results for and [n] conform well with 

indications about perceptually relevant acoustic cues in sections 2. 2. A. and 
2.2.B. according to data, from other languages. [}*] has been found to be 
exclusively dependent on strongly positive F2-F3 trans \tions in agreement with 
reported perceptual data from Polish and cues for Russian palatalized non- 
nasal consonants. This powerful transition effect also confirms suggestions 
made- in section 2.2.B. about the possibility of a identification for a 

strongly positive F2 transijipj^n the absence of appropriate [9 J cues by 
speakers of languages with tfjT Moreover, the fact that no perceptual effect 
follows the contrast in N3-N4 values between []*>] murmur and LnJ, LpJ murmurs 
(Figure 1 ) °is consistent with the irrelevance of high formants at the murmur 
portion in place identification of nasal murmurs and, therefore, with the 
perceptual significance of N1 and NZ values^ In this summary of cues one must 
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point out that the -inclusion in the experimental paradigm of different F1 
transitions for contrasting nasal categories might have added some relevant 
information about place identification, , * 

Data in Figure 7 show to what extent the perceptual results are 
consistent with production measurements gathered from Catalan speakers. 
Crosses along both diagonal lines point to values for F2-F3 transition ranges 
corresponding to synthetic stimuli ot tests 1a and 1b. Phonetic symbols 
recorded on these lines indicate prevailing- interstimuli category judgments 
for transition continua in all different murmur conditions. Dots stand for 
F2XF3 transition range values corresponding to productions of [m], [n], [y^ 9 
[•)] by single Catalan speakers summarized in Table 6; they are grouped into 
production spaces for each nasal category. Transition range values for the 
reference Catalan speaker chosen to prepare the synthetic speech stimuli are 
represented by encircled dots. Such a graph has been fouhd to accord more 
with perceptual processing of nasals than one that relates stimulus points to 
production data on transition endpoint values: A comparison of results from 
tests 1a and -1b has shown that, in -categorizing stimuli, listeners were in 
fact attending to F2-F3 transition ranges relative to the vowel steady state 
value and not to absolute transition end points* A satisfactory coincidence is 
found between perceptual category judgments and category jproduction spaces for 
different nasals, thus confirming the fact that transitions are good identifi- 
cation place cues. While this is clearly the case for the perceptual contrast 
between [n] and [j»] f 7 it can be seen that' murmur structures ([^] murmur for 
ty] identification; [n], [yj murmurs for [n] identification) are used by 
Catalan speakers as identification cues for F2-F3- range values that lie 
somewhere between or on the edges of [j] and [n] production spaces. This 
finding argues for the existence of a trading relation between acoustic traits 
spread over time in the process of [p] vs. [n] identification, and shows that 
acoustic cues are integrated into a unitary phonetic percept in the process of 
dynamic perception. Thus, perceptual complementarity between transitions and 
murmurs accords with the fact that, given an ambiguous set of F2-F3 formant 
transitions between [^] and [n] production spaces, listeners report /jj}/ or /n/ 
judgments depending on whether the following murmur structure is appropriate 
fbr [^] or [n]* respectively. Moreover, consistent with reported observa- 
tions!, [^] murmur appears to have greater perceptual weight than In] murmur, 
since the perceptual range for [j] affects more considerably the [n] produc- 
tion space than that for [n] the fy] production space. 



4. CONCLUSIONS - 

In the previous sections I have investigated the .interactive role of 
formant transitions and unreleased murmurs in the process of identification of 
[n], [v], [9] with [a] in VC syllables using synthetic speech stimuli. 
Perceptual results from Catalan speakers strongly suggest that syllabic cues 
such as transitions and murmurs are simultaneously processed in a phonetic 
mode. As Studdert-Kennedy (1977) has made clear, dynamic acoustic events "are 
jointly shaped by the timing mechanises of motor control and by the demands of 
the auditory system for perceptual contrast and compression" (p. 17). As 
exemplified below, production and perception data reported in this paper show 
that thero is evidence for both related strategies in the process of 
identification of place for nasals, namely, reference to articulatory 
constraints and to constraints imposed by the auditory system. 
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Figura 7.- Comparison of porcoptlon (Figuraa 4,6) and production 

(TABLE 6) data for Catalan aubjacta with apodal rafaranca 
to formant tranaitlont.Ordinata: F2 rangas (in Hi);abaciaaa: 
F3 ranga* (In Hz). 
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Reference to psycho acoustic constraints imposed by the auditory system is 
presumably needed to account for specific acoustic cues that make transitions 
and murmurs perceptually relevant. On the one hand, it would argue for the 
correlation between amount of V2 transition range and perceptual relevance of 
F2 transition found to be true for nasals with respect to the arrangement [?>]> 
[ n ]> [j]* An explanation for this effect is suggested by Klatt and Shattuck 
(1975). They found in experiments with non-speech stimuli that the perceptual 
importance of an F2-like chirp with respect to an F3-like chirp is positively 
correlated with its frequency height. That is, the effect would increase with 
an F2 transition such as that for (strongly positive) with respect to [n]. 

and such as that for [n] (moderately positive) with respect to [^J 
(slightly positive but also slightly negative or steady) . No auditory 
constraint is known that can handle perceptually distinctive cues of L 0 J vs. 
[n] murmurs. 

An auditory analysis of that sort is compatible with a perceptual 
processing mechanism of relevant acoustic cues that keeps track of the 
underlying unitary articulatory gesture. In fact, it is to be thought that 
constraints imposed by the auditory mechanism in the interpretation of cues 
are integrated at a more central stage with reference to a dynamic and 
continuous set of coarticulation strategies. Evidence for such related events 
can be derived from Figure 7: according to data displayed there, nasals are 
perceived with reference to well-established category spaces and essentially 
processed, at least for [a], upon F2-F3 transition .range values and upon 
murmur characteristics for potentially "ambiguous" transition configurations. 
Reference to the continuous production event is also needed to account for the 
perceptual decoding of syllabic spread cues: thus, as shown, transitions and 
murmurs (and, presumably, releases) are evaluated *. simultaneously and 
complementarily in a way that a cross-category maximum to minimum perceptual 
effect for transitions ( []^]>[n]>[ ) corresponds invariably to a minimum to 
maximum effect for murmurs ( Lj*]<[n]<[p] ) ). Such reciprocal correlation 
conforms to the existence of A trading relation between transitions and 
murmurs* for [9] and [n] with [a] in Catalan (section 3.3.C.) and a defined 
compensatory effect between strongly positive []\] transitions and perceptually, 
irrelevant [>] murmur. Further evidence for simultaneous processing of nasal 
cues at the syllabic level according to vowel quality has been reported to be 
true for [^] preceded by [*], [fc] (section 2.2. C.) . 

A perceptual model that allows this sort of auditory analysis of 
transition ranges to occur presupposes reference to a basic set of articulato- 
ry gestures but is not compatible with a feature recognition model based upon 
the auditory detection of invariant short-term spectral properties. Blumstein 
and Stevens (1979) report properties of this sort at a 6 msec window release 
period of [mj (diffuse- falling frequency-amplitude template) vs. [n] (diffuse- 
rising frequency-amplitude template) in initial position. Frequency-amplitude 
spectral characteristics at the release can be hardly thought to be "primary 
place cues" since unreleased nasals are equally common and occurring releases 
may be perceptually weak, almost indistinguishable; moreover, murmur spectra 
can hardly provide such cues, given their low amplitude component, high 
variability and particular pole-zero structure. In fact, transitions have 
been' found in my experiment to be the best place cues in combination with 
appropriate murmurs: overall, gliding F2-F3 transitions have given more than 
35% of [jj judgments and optimal transition-murmur combinations 80% of LnJ and 
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ty] responses. While, in order to handle these aspects, the perceptual model 
proposed by Stevens and his . coworkers can be shown to be too simple and 
limited, it becomes too complex and complicated on other grounds. Thus* 
contrary to what has been suggested by Stevens (1975), examination of long 
stretches of acoustic data (presumably hundreds of msec long) before phonetic 
feature decoding begins is njt needed in the case of diphthong- like spectral 
nuclei such as palatal and palatalized articulations: short pre-closure 
transitions are satisfactory cues for with [a] and can be processed in the 
same way as those for [n] and [^]. 

I have argued for a perceptual process of nasals based upon the 
simultaneous integration of acoustic cues according to demands imposed by the 
production and auditory systems. But, given [a] and other possible vocalic 
environments, what is the articulatory basis for this integration process? 
Little perceptual data on the identification of nasals of different pieces of 
articulation in different vocalic environments is available. The only system- 
atic approach %a that of Henderson (Note 1). Henderson's data and evidence 
provided in this paper support the view that the perceptual interpretation of 
transitions and murmurs for nasals preceded * ' different vocals is similar to 
the integration of burst and transitions foi aon- nasal stops of the same place 
of articulation in CV environments (Doraan et al.~xf977). Thus M murmurs 
are optimal cues and [nj transitions very weak cues for [i], [a]/[0], [o], 
[ul; for [e], [&] the role of transitions becomes more relevant, while for 
[e] — but not for [t] — that of the murmur decreases. For [n], transitions are ' 
very effective cues with back vowels but very weak with [i] and [e] tfhile 
murmur is, complementarily, a better cue for [i], in accordance with earlier 
findings (Hecker, 1962; Nakata, 1959; Ohala, 1975). General effects for [n] 
are to be expected also for [>] and ought to be correlated with long 
transition duration with back vowels vs. short transition duration and little 
excursion range with high vowels (see, for Polish, Jassem, 1964). 

While the perceptual relevance of vowel- to- consonant transition ranges 
for nasals accords well with data from Doraan et al. (1977), that of murmurs 
is only consistent for [j] with all vowels but [*•] and for [n] with [i]. 
Differently from alveolar bursts, for [n], strong murmur effect is found in 
the case of [o], [fi], [u] and less for [a], [e], [»]. Such findings about the 
category identification role of .murmurs in different vocalic environments 
suggest that the interpretation strategy used by listeners in associating 
murmur and articulatory event differs from that proposed by Doraan et al. for 
burst, avd. front cavity size. v In the case of final nasals, different 
articulatory conditions argue for different integration strategies: no burst 
is present and release is weak and unnecessary; spectral continuity cannot be 
expected between oral transition endpoints and oro-nasal murmur concentrations 
characterized, moreover, by a low perceptual relevance of the mid spectral 
regions; finally, energy concentration and crucial place information in nasal 
r.urmurs are dependent on the size characteristics of the oro-nasal system™ 
behind the tongue constriction point (back cavity); A plausible integration 
model or Henderson'^ data would* associate the back cavity for the nasal 
consonant with the overall front-back cavity system for the vowel so that the 
perceptual effectiveness of the murmur would depend on the degree of similari- 
ty between back nasal cavity and front or back cavity size appropriate for the 
vowel. Such a model would predict perceptual relevance for [jj murmur with 
the back oavity size of coarticulated back and front vowels, for [m] murmurs 
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with the considerable front cavity size of back vowels, and for tyi murmur 
with the wide pharyngeal pass of palatal vowels. For Cn] murmur there would 
4>e integration with the overall tract system of a mid vowel and the front 
cavity of [il. Obviously, more experimental evidence is needed. In any case, 
data frotc Henc ♦ 'son (Note 1 ) and experiments reported in this paper show that 
transitions and murmurs, analogously to bursts and transitions, are equivalent 
and complementary. 



REFERENCE NOTE 

1. Henderson, J. On the perception of nasal c onsonants . Unpublished Gener 
als Examination paper, University of Connecticut, 197$. 
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FOOTNOTES 



* - . • . 

- h represent with [>] palatal as weU as palatalized nasal stops. Velar 
nasal stops are consistently^rfeyrese^ted with^J independently of their 
phonological status. * * 

• • * 2Re search is to be" done, on the Relative differences in pharynx cavity 
~size and size of vflum opening amcmg nasals of different place of articula- 
tion. 5 It deserves to be 3een to what extent the acoustic structure and the 
perceptual . role -of their murmur fonnants are affected by those dif ferences.* 

3The reliability of < the results remains open .to the objections raised in 
ruction 3.3. C Furthermore, • possible bias effects can be related to the fact 
that all experiments were" forced,- choice. AlW, the fact that^American English 
* subjects in Mal^cot* s experiment gave 80% of [%] responses- for original J 
while )00% of [m] and [n] responses /or original [jn] and [n], respectively, 
suggests response bias effects against the velar correlated That American 
English speakers identify [o] after [aj very reliably has been shown with 
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original, natural speech stimuli by Henderson (Note 1) OOOJ) and Zee (1981) 
(96%). 

« 

• * > 

call transition range value the amount of frequency contrast between 
steady state vowel and starting point or endpoint value of an adjacent formant 
transition. It is expressed in Hz and can be positive (♦), negative (-) f or 
null (=). ' « 

SAcoustic measurements of zeros were inferred from frequency areas that 
show a major reduction in the magnitude of the energy envelope as observed in 
spectrographic spectral sections. A final decision about zero frequency 
values ^to be included in tl>e synthetic speech patterns used for- perceptual 
.testing was also reached 'on, the basis of "measurements reported in the 
literature, as well as well-accepted observations on formant-antiformaot spec* 
tral characteristics of nasal murmurs corresponding to different place catego- 
ries (see Sections 2.1 .B. and 2. 2. A., and Table 1). \ 

: » 

6 Bandwidth valines' were estimated by measuring the distance between . t#o 
points at the right and left side of the spectrum envelope; equally 'located 
3dB below the peak level^ ^ 

?The mismatch between perceptual stimuli and frequency values correspond- 
ing to the []t] production space (Figure 7) did not affect the quality of [Jt] 
Judgments, thus suggesting- that subjects p%rceiye a positive F2 transition as 
a palatal cue when pointing to a critical high locus. 
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I.- - INTRODUCTION 



One of the most devastating effects of congenital hearing loss is that 
normal* development of speech is often disrupted. As a consequence, most 
'hearing-impaired children must be taught tt*$ sj^ech skills that normal-hearing 
chi'MreiH^readtly^acquirevduring the first few years of life. Although some 
nearing-impaired Children develop intelligible speech, many do not. For many 
yeans, it was be'U?ved that profoundly hearing- impaired children were incap'a- 
-ble^of learning to talk. Carrying this belief td the extreme, Froeschels. 
(1932) even suggested that all deaf children exhibited some. behavior problems", 
"due td $he fact that the profuse motor release connected with speech is 

impossible io their cade M (p. 97). % 

. . * *i * * * 

Within the last decade, Advances have been made in Studying the speech of 
jfche hearings-impaired. , This is largely due to, the development of sophisticated 
processing and Analysis techniques in speech science, electrical engineering, 
• and computer science that have increased- our knowledge of normal, speech 
production. In turn/ these technological advances have .been applied to the 
analysis of the speech of the hearing impaired, and also to the development of 
clinical, assessment and training procedures. 

- The 7 oral communication skills of hearing- impaired children haVe long been 
of concern to .educators of »the hearing impaired, speech pathologists, and 
audiologists because the'/ adequacy of such skills can influence the social, 
educational,, and career -opportunities available to these individuals. Since 

' the introduction of PL 94-142 and the emphasis on mainstreaming, there is an 
even greater likelihood that many professionals will need to- learn about, or 
upgrade their - knowledge of, the speech of hearing-impaired children. The 

0 intent of this chapter is to provide the clinician, student, and researcher 
with a ^comprehensive description of ;the speech characteristics of this 
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population. It is assumed that the reader has' some familiarity with the 
effects of 'congenital hearing loss on speech and language development, and has 
some exposure to acoustic and articulatory phonetics. It should be noted that 
most of the information available about the hearing impaired is concerned with 
children with severe and profound sensorineural hearing losses (losses. of 70 
'dB'HTL or greater). In comparison, relatively little is known about the 
speech of hprd-of-hearing children (losses less thaa -70 . -dB HTLh_.lt is._f.or__ 
this reason that most of the chapter is devoted to children who are severely 
and profoundly hearing impaired. 

In order to present an in-depth coverage of speech production processes, 
we have opted to discuss language skills only in 'those instances where there 
is no clear-cut separation between language and speech. Likewise, the 
auditory skills of the hearing impaired will be discussed only to the extent 
that factors" such as hearing level and auditory capabilities affect speech 
production skills. The emphasis on speech production is not meant to suggest 
that an aural/oral teaching method is the only appropriate educational plan 
for hearing-impaired children. The issues involving educational methodologies 
are not ef primary concern here, father, it is the belief of the authors that 
every hearing-impaired child is entitled to speech training services, even if 
a realistic goal of such training may be only the . development of functional 
(survival)^ speech skills. .Before optimal teaching strategies can be selected, 
however , Tteachers and clinicians must have a thorough understanding of the 
nature of^the problems they are trying to remediate.* 

II. DEVELOPMENTAL ASPECTS OF THE SPEECH OF THE HEARING IMPAIRED, 

A. Vocalization Patterns 

For many years, it was believed that the vocalization development of 
hearing and hearing-impaired infants was the same, at least through the 
babbling stage. After this period, hearing-impaired infants were reported to 
stop babbling. This no'tion was based pFimarily on Mavilya f s (1968) data, 
which showed a marked . decrease in the number of vocalizations produced by 
three cpngenitally hearing-impaired infants (T2-16 weeks old at the start of 
the study) over a three-month period. Recent data obtained by Stark (in 
press) do not support the findings of Mavilya. For a group of hearing- 
impaired infants 15-24 months old, Stark observed an overall increase in rate 
of vocal output with age, The mean number of vocalizations was also observed 
to increase as progressively higher levels of vocal output were attained by 
the infants. In general, thri stages of vocalization behavior of the 15- to 
2'4-aonth-old hearing-impaired-anfants were similar to those 6f a group of 
normal-hearing infants 9-48 weeks of age. An important point that should be 
made is' that the speech behavior of the infants in both the Stark and Mavilya 
studies was recorded before the children had been fitted with hearing aids. 
Stark found that the level of vocal development reached by the children befpre 
they were .fitted with amplification did not appear to predict their later 
progress, in .learning Speech. 'The vocal development of some children pro- 
gressed rapidly after they were given hearing aids, while the vocal develop- 
ment of others did not. 
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Although Stark found no difference in rate of vocal output between the 
normal-hearing and hearing-impaired infants, differences in the phonemic 
repertoire were found between normal infants and hearing-impaired infants who 
were judged to be at the same level of vocal development. Syllable shape 
(e.g., CV, VC, CVC, etc.) was similar among the children, but the inventory of 
vowel- or consonant-like sounds was more limited in the samples and tended to 
be more stereotyped than those of hearing infants of the same age. Mavilya 
also observeo^thl* the "pKortemrc~aSpieetiS-of the-vocattzattons-of-the-hear-ing-— 
impaired infants in her study were different from those reported for infants 
with normal hearing. Specifically, Mavilya observed a severe delay in the 
"development of consonant sounds in the vocalizations of the hearing-impaired 
infants, with vowels produced more often than consonants. 

In an earlier study, Stark (1967) analyzed the phonemic aspects of the 
vocalizations of six congenitally hearing-impaired children betweetf the ages 
of 16 and 19 months before they were fitted with hearing aids. Analysis of 
the "infants' vocalizations revealed that the following sounds .were used by all 
six babies: (1) a low front vowel, such as W\ (2) a i eutral """^l °«" 
schV (3) an aspirant /h/, which could precede or follow vowel. sounds; <4> a 
syllabic nasal consonant usually identified as /ml, and (5) 'a glottal stop. 
An interesting observation made during this study was that the 
vocalizations of the hearing-impaired infants, Such as wh luring, ilgUnf, 
crying, and laughing, did not sound deviant, and therefore this aspect of . 
- vocal behavior did not provide diagnostic information about the hearing status 
of infants. 

In summary, the results of Stark's (1967, in press) research do not 
support the belief that hearing-impaired infants simply cease vocalizing after 
the babbling stage. Differences between the vocalizations of nornl-hearlng 
and .hearing-impaired infants do emerge at an early age, but the d J"V«.°JJ 
are seen in phonemic production rather than quantity of vocal output as 
suggested by Mavilya (1968). \ 

' B. Speech Sound lTTventories 



* FWtic inventories have been obtained from the spontaneous samples of 
hearing-impaired children ranging from eleven months to seven years of age 
(Sir? 1953; Lach, Ling, Ling, & Ship. 1970; Stark, in press; Sykea. 1940, 
West & Weber, 1973). Although these studies report *?f*™Z" J* JJ« 
frequency of specific vowel sounds in the samples °' ^ 1 - ^^ lr ^ h J^5 
Btudied. the pattern of vowel production is remarkably similar. The vowels 
mpst commonly used by young hearing-impaired children m in ^%^' G ^ r l) 
vowels /a 9 I and the low front vowels /£,ae/. The extreme high vowels /i, u/ 
occu red^e^atrvely infrequently in the children's samples The exception to 
this pattern was reported by Carr (19534. whose five-year-old hearing-impaired 
sublects used a wider range of vowels than noted above. There is some 
evince that this patterh of vowel usage 6hanges over time... For example 
laoh et al (1970) found that over a one-year period, young hearing-impaired 
chUdren 11^2 months of age, who were enrolled in a preschool program 
tended £o shift from the frequent use of the schwa vowel to other vowels, with 
tie i Greatest Increase in usage observed for III. Parr (1953) also compared 

h Velltue f™y of each vowel type in the : speech of ' ^e-year-old 
hearing-impaired children to that of hearing children and noted that the 



hearing-impaired children used vowels in a manner .,and degree similar to 
hearing infants of 11 to 12 months of age. ,The hearing-impaired children were 
also found to use vowel, sounds more often than consonant sounds. In another 
study, Sykes (1940) found that 4- to 7-year-old hearing-impaired children 
produced almo3t half of their vowel sounds in isolation and not in combination 
with a consonant. 

,-Analyses of consonant production have-shown that -young hearing-impaired 
children produce front consonants /b t p t m f w/ more often than they produce 
back consonants (Carr, 1953; Lach et al. f 1970; Sykes, 1940), and they have 
been found to use front consonants with greater frequency than do hearing 
. children (Carr, 1953). In a longitudinal study, Lach et al. (1970) analyzed 
consonant usage <• by manner of production Before the children, began a 
preschool program, 66% of all consonants produced were glottal sounds, and 
approximately 25% of the sounds were nasal consonants. After one year in the 
program, the glottal sounds were used only 44% of the time. There was also a 
large increase in the usage of plosive's and semi-vowels, due primarily to an 
increased use of /b/ and /w/. 'Fricatives and affricates were used only 
rarely, even-' after one year of training. With only one exception, all 
children produced a significantly greater number of consonants and vowels 
after -orte year of training, with a concomitant increase in the consonant-to- 
Vowel ratio. 

C. Phonemic and Phonologic Skills 

A limitation of the simple sound-type inventories, which were discussed 
above, 13 that no information is provided on the phonological usage of the 
speech segments. A tally of consonants &nd vowels does not reveal whether or 
not the phonemes were used appropriately. To overcome this' limitation, 
investigators have begun to perform phonemic, phonological, and linguistic 
analyses of hearing-impaired children's speech (Oiler, Jensen, & Lafayette, 
1978; Oiler & Kelly, 1974; Stoel-Gammon, in press; West & Weber ,• 1973) . 

Recently, a comprehensive study was performed by Stoel-Gammon (in press) 
in which cross-sectional and longitudinal data were obtained on phonological 
acquisition by hearing children, 1.5 to 3.10 years of age, and hearing- 
impaired children, 2.4 to 7.3 years of age. The cross-sectional data showed 
that, in large part, the patterns of development were similar for the two 
* groups of children, although the rate of development was considerably slower 
for the hearing-impaired children than for the hearing children. Similar 
patterns of correct production and error types were present for both groups of 
children. The set of substitution patterns common to both groups included 
voicing of initial stops, devoicing of final stops, fricatives, and affri- 
cates, and substitution of homorganic stops for fricatives. When errors were 
common to both groups, they were more" frequent in the speech of the hearing 
impaired than in the speech of the normal-hearing children^ 

Some differences in the pattern of development between the normal-hearing 
and hearing-impaired children were also observed in the above study. Errors 
found to be present only in the hearing-impaired children's speech were: 
substitution of a glottal stop for the target phoneme, substitution of the 
palatal fricative /J7 for the affricates /t$ and d3/, and substitution of 
consonants /h, k, g/ for other non-labial consonants. The only substitution 
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that Stdel-Gammon found to occur in the normal children's productions that did 
not occur in those of the hearing-impaired was depalatization of t), d3/ t 
resulting in a substitution' of /s/ for /J/ or /ts/ for /t^ , djf/. The data 
also showed that the substitutions of the hearing-impaired' children deviated 
farther from the target phoneme with respect to manner and place of production 
than^did the substitutions of the normal children. In addition, the errors of 
tfiehearing-impaired subjects also tended to show a larger range of substitu- 
tion types, for example, /k, g/ for /tj/, than those made by the hearing- 
children* v 

The longitudinal data obtained by Stoel-Gammon revealed that the hearing- 
impaired children progressed toward correct production of target phonemes at a 
much slower rate than the normal-hearing children and that there was a much 
greater range and variation of response * types t both within and across 
subjects. The preliminary data suggested that the hearing-impaired children,, 
passed through three developmental stages. In the first stage, the child 
produced <a wide variety of substitutions for the target phoneme* In the 
second stage, there was a narrowing of the range of substitutions, followed by 
substitutions with a single sound. ( In the third stage, the phoneme was 
produced correctly* Of course, not all hearing- impaired children progress 
through the third stage, as evidenced by numerous pho»etic errors that remain 
in the speech of many hearing-impaired persons even throughout their adult 
life. 

Additional research is needed in order to delineate the stages of speech 
acquisition in hearing-impaired children. This information is essential to 
help us better understand why some children develop intelligible speech and 
others do not. Although thfere are data suggesting that hearing-impaired 
children are simply delayed in phonemic acquisition (Oiler et al., 1978: Oiler 
& Kelly, 1974; Stoel-Gammon, in press), we also know that there are differ- 
ences in the phonology used by hearing children and hearing- impaired children. 
In fact, there are noticeable differences between the production patterns of 
the two groups of children at a very early age, anc* the speech of some 
hearing-impaired children never progresses beyond the very early stages of 
development. As we shall see in the following section, the speech production 
patterns of older, hearing-impaired children show many similarities to the 
patterns Qf the younger hearing-impaired children. It will also become 
evident that although in many cases, hearing- impaired children fail to follow 
rules 4 typical of normal speech, the deviations in their speech show systematic 
patterns, indicating that they are using a set of phonological rules, even 
though these rules may differ from those used by normal speakers. 

III. ARTICULATOR PATTERNS IN THE SPEECH OF SEVERELY AND 
PROFOUNDLY HEARING-IMPAIRED CHILDREN 



A. Production of Consonants 

1 . Overview 

Perhaps of all the speech production errors characteristic of the 
severely and profoundly hearing impaired, the area that has received the 
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greatest attention is that involving the articulation of consonants > vowels, 
and diphthongs. Numerous independent investigations (Hudgins 4 Numbers, 1942; 
Markides, 1970; Smith, 1975; McGarr, 1980) have been remarkably consistent in 
identifying typical articulatory errors in the speech of hearing-impaired 
children who were trained in many different programs. Most of these investi- 
gations are of a descriptive nature; that i*S, either listener judgments or 
phonetic transcriptions were used to obtain measurements of intelligibility or 
to describe the articulatory characteristics of the speech. However, some 
investigators (Calvert, 1961; Monsen f 1974, 1976b, 1976c; Rothman, 1976) have 
begun to derail some of the acoustic characteristics of the speech of the 
hearing impaired (e.g., voice onset time, closure duration, forraant frequen- 
cies). Acoustic analysis of hearing- impaired speech -permits a finer-grained 
consideration of some aspects of both correct and incorrect productions than 
would be possible using methods applied in .the descriptive literature. 

For purposes of organization we will consider the production of supraseg- 
mentals as well as other factors that affect the intelligibility of speech 
later in this chapter. This section will present information only on the 
segmental aspects of hearing-impaired children's speech. We will first 
consider the error patterns detailed in the descriptive literature and then 
discuss the relevant acoustic data for production of consonants and vowels by 
hearing-impaired speakers. 

2. Consonant Errors 

Any comprehensive analysis of the articulatory skills of hearing-impaired 
children must begin with the classic work of Hudgins and Numbers (1942). 
These authors studied 192 subjects between the ages of 8-20 years whose 
hearing losses ranged from moderate to profound. The students read simple 
sentences. From recordings, teachers of the deaf later evaluated the 
students f " productions for proficiency in articulation as well as rate and 
rhythm. Error categories were established for consonants, vowels, and 
diphthongs, and an attempt was made to relate these patterns to speech 
intelligibility. 

Briefly, the articulatory errors were divided into substitutions, omis- 
sions, and severe distortions of the intended phoneme as, well as the addition 
of adventitious phonemes or syllables. Among the more common error types 
involving consonants were confusion of the voiced and voiceless sounds, 
substitution of one consonant for another, added nasality, misarticulation of 
consonant blends, misarticulation of abutting consonants, and omission of word 
initial or word final consonants. This overall pattern of consonant errors 
has been replicated in numerous studies (Brannon, 1966; Geffner, 1980; Gold, 
1978f Levitt, Smith, & Stromberg, 1976; Markides, -1970;. Nober^ 1967; Smith, 
1975), although the actual percentage of errors • in any category m^y vary 
somewhat from stud/ to study. . 

a. Voicing errors . Errors of voicing were iDne of the most frequent 
types of consonant errors found by Hudgins and Numbers (1942). In subsequent 
studies, the direction of this error has sometimes been reported as occurring 
to the voiced member of the pair (Carr, 1953; Heider, ,Heider, & Sykes, 1941; 
Millin, 1971; Smith, 1975), and at other times*, to the voiceless cognate 
'(Mangan, 1961; Markides, 1970; Nober, 1967)- 
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Smith's (1975) study of 40 severe to profoundly hearing-impaired children 
enrolled in an oral school for the deaf has been among' the most comprehensive 
since Hudgins and Numbers, The 40 children read sentences containing key 
words that incorporated the most frequent English phonemes with transition to * 
and from the vowels /i/, lag/ \ and /u/ for all places of articulation'. Voicing 
errors were common for these children and most often involved substitutions of 
the voiced for voiceless member of the pair. Studies by Heider et al. (1941) 
and Carr (1953) have also reported a tendency for hearing-impaired children to^ 
use more voiced than voiceless sounds in their spontaneous speech. Indeed, 
Millin (1971V suggested that one manifestation, of the voiced for voiceless 
problem is inappropriate phonation evidenced at the beginning or end of an 
utterance. 

. This error pattern, voiced for voiceless substitution, is opposite to 
that found by Markides (1970) who studied 110 British hard-of-hearing and deaf 
children. The children produced words as part of an articulation test. A 
common error was substitution of the voiceless cognate for the voiced. Using 
the Templin-Darley Test of Articulation, v Nober (1967) analyzed production of 
phonemes by 46 seyere and profoundly hearing-impaired children. He reported 
that voiceless phonemes were produced correctly more often ..than voiced 
phonemes. Data obtained by Mangan (1961) can also be interpreted to show the 
difficulty hearing-impaired children have with voicing contrasts. Subjects in 
this study were reported to devoice final voiced consonants. - 

Taken together, these studies suggest that coordination of the articula- 
tors necessary for voicing contrast is an exceedingly difficult task for 
hearing-impaired speakers. Recently, some investigators (McGarr 4 LOfqvtst, 
£n press; Whitehead 4 Barefoot, 1980, among others) have begun t^ examine the 
physiological manifestations of some typical errors in the speech of the 
hearing impaired. Their data suggest that the nature of the voicing error may 
be far more complex than is suggested by the descriptive literature. In fact, 
some hearing-impaired speakers fail to coordinate the timing of respiration, 
phonation, and articulation in attempting to produce voicing contrasts. More 
will be said about these findings in a later section. 

b. Substitution errors : Place of articulation . Another common articu- 
lator error in the speech of the hearing impaired involves the substitution 
of one phoneme for another. Frequehtly, the substitution is tc a phoneme with 
v a similar place of articulation. There is general agreement that phonemes 
produced in the front of the mouth are more often produced correctly than are 
phonemes produced in the back of the mouth. This makes sense when one 
considers that the relative visibility of articulatory gestures should be 
important to hearing-impaired persons for whom there is reduced auditory 
information. * , 

Substitution errors involving the same place of articulation have been 
noted in several studies. Nober (1967) analyzed correctly articulated conso- 
nants according to place of articulation and then ranked them from highest to 
lowest scores as follows: ' bilabials, 59%; labiodentals, 48*; gloltals, 3 1 **; 
linguadentals, 32%; -lingua alveolars, 23%; linguapalatals, 18%; and lingua 
velar3, 12%. Similar patterns of correct production have been reported by 
Smith '(1975) and Gold (1978) except these -investigators found that sounds 
produced in the middle of the mouth were more prone to error than were sounds 
produced in the back of the mouth. 233 
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This general trend— better production for more visible phonemes — has been 
found not only for production of isolated words and sentences (Huntington, 
Harris, & Sholes, 1968; Geffner 4 Freeman, 1980; Levitt et al. t 1976; Levitt, 
Stromberg, Smith, 4 Gold, 1980; Smith, 1975), but also for spontaneous speech 
(Carr, 1953; Geffner, 1980; Heider et al.,~1941). 

Some caution should be exercised, however, in interpreting the importance 
of visibility in and of itself as a key factor in production. Some 
articulators, such as the lips, although quite visible, are also relatively 
more constrained and thus permit fewer possibilities for errors than other 
articulators such as the tongue. , Later we shall discuss some physiological 
data obtained by Huntington et al. (1968) and McGarr and Harris (1980), which 
is pertinent to this issue. 

c. Substitution errors : Manner of articulation . A common observation 
arises from an analysis" of consonant errors according to place of articula- 
tion. Hearing-impaired v speakers tend to position their articulators fairly 
accurately, especially for those places of articulation that are highly 
visible, but fail to coordinate properly the movement of the articulators 
(Huntington et al., 1968; Levitt et al., 1976). The type of consonant 
substitution that occurs in these cases is often described as one resulting 
from . incorrect timing. These errors are also described as involving an 
inappropriate manner of articulation. r * 

One example of a common error is the nasal-oral substitution. According 
to Hudgins and Numbers (1942) t errors in nasality may be considered to be a 
segmental problem and also a problem affecting voice quality, although here we 
are interested primarily in the former. Non-nasal phonemes were reported by 
Hudgins and Numbers to be nasalized, and nasal consonants were often produced 
as 3tops. Similar findings have atf.so been noted by Markides (1970), Smith 
(1975), and Stevens, Nickerson, Boothroyd, and Rollins (1976). 

Other errors in manner of articulation have also been noted. Smith 1 s 
hearing-impaired children made most errors producing the following: "palatal 
plosives, fricatives, affricates, and the nasal /}/. Glottals were frequently 
substituted for stops. Fricatives showed a high rate of substitution to, but 
not from, the plosives. Affricates were never substituted for other conso- 
nants, but tended to be substituted by one of their components, usually the 
plosive component. However, bilabial plosives, the glides, and the fricatives 
/f/ and /v/ were often produced correctly. Nober f s (1967) results also 
followed the general pattern reported by Smith. Glides were most often 
correct, followed 1>y stops, nasals, and fricatives. Similar findings were 
obtained- by Geffner and Freeman (1980) for 67 six-year-old severe and 
profoundly hearing-impaired children attending schools for the deaf throughout 
New York State. . v 

The ' articulatory movements for both alveolar and velar sounds are 
visually obscure. One reason why alveolar sounds may be more prone to error 
s than velar sounds is that mo *e sounds are produced in the middle than in the 
back of the mouth. Because of this, precise positioning of the articulators 
is necessary in order to differentiate correctly all the sounds with a medial 
place of articulation. Thus, greater variability in articulatory placement 
can be tolerated before the velar sounds are misperceived by the listener. In 
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any *vent, a consistent finding is that hearing-inpaired children correctly 
produce the highly visible phonemes (i.e., those produced in the front of the 
mouth) more often than those phonemes that are not articulated with a high 
degree of visibility (i.e., those produced in the middle or back of the 
mouth). 

- d. Omission errors . By far the single most frequently .reported error in 
the speech production of the severely and profoundly hearing impaired is the- 
omission of a phoneme (Hudgins & Numbers, 1942; Markides, 1970; Smith", 1975). 
Omission of consonants may occur in the initial and/or final position of 
words, also reported as non-function of releasing or arresting consonants, 
respectively. 

Hudgins and Numbers reported that omission of initial consonants was more 
common than omission of final consonants. The consonants most frequently 
omitted from the initial position of words included /h, 1, r, y, th, s/. 
Turning to final consonants, the authors describe several error patterns: 
dropping of the consonant completely, releasing the consonants into the 
following syllable, or incomplete production whereby the phoneme loses its 
dynamic properties and becomes merely a passive gesture. Among the final 

consonants most frequently omitted in the study by Jf^J"- ^*2JJJHJ 
/l t, s, z, d, g, k/. These results are in agreement with those reported by 
Geffner (1980), who analyzed the spontaneous speech samples of young nearmg- 
impaired children. 

Others (Nooer, 1967; Markides, 1970; Smith, 1975) have also observed the 
omission of similar consonants from the speech of hearing-impaired ^jreri; 
In contrast to Hudgins and Numbers, however, .these studies repoi rteda greater 
number of consonants omitted from the final position of words than fromeither 
the initial or medial positions. 

e. Consonant-cluster errors. Not many investigators have reported data 
for production of consonant blends. This is surprising since Hud ^ 
Numbers suggested that these errors had ? .n important and deleterious ef fee on 
intelligibility. In their study, these errors involved two forms, one or 
more components of the cluster were dropped, or an adventitious Phoneme, 
usually the /d/. was added between the elements. This latter error may be 
particularly detrimental to the timing or rate and rhythm of speech. Brannon 
(1966) also found that misarticulation of consonant blends was a sign if.cant 
erro- in the speech of hearing-impaired children. Smith (1975) tested, 
consonant blends'/p. t. k/ and /s/ in the speech production of older hearing- 
impaired children (13-15 years old). Here again, there was frequent omission 
of one 6r more elements of the cluster. In fact, a phoneme in the blend 
environment was more likely to be- omitted than the same phoneme occurring in a 
nc- -blend environment. 

B. Acoustic Characteristics of Consonant Production 

We now turn to a discussion of ' the acoustic patterns of consonant 
producttcn. While these consonantal features have been much studied in normal 
and also in synthetic speech (cf. Borden & Harris, 1980; Pickett. 1980. for a 
Sview of t>his work), there ..have be^n far fewer studies of the acoustic 
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characteristics of consonants produced by hearing-impaired speakers. This is 
in part because spectral measurements of hearing-impaired speech are particu- 
larly difficult to make, either because of the mismatch between spectrograph 
filter and fundamental frequency (cf. Huggins, 1980) f or because of source 
function abnormalities. 

In normal speech production , the acoustic consequences of consonant 
production ar.e complex and spread over „a period of time. They involve 
differences in the sound source and the spectral composition ofr the signal. 
For example, in the production of a voiceless fricative in a vocalic 
environment (e.g., VCV, "I see"), the sound source is changed from a periodic 
to an aperiodic one, and then back to the periodic* source. Similarly, a 
voiceless aspirated stop in a similar VCV environment (e.g., "a pie") is 
associated with the fallowing sequence , of source changes: periodic voicing 
during the preceding vowel, silence during the consonantal *°osure, transient 
noise, aspiration noise, and periodic voicing during the vc 1. In addition 
to being spread across time, the acoustic attributes of man? consonants often 
involve short-term spectral changes, where high frequency components play an 
important role/ Examples of such attributes arS release bursts and formant 
transitions for stop consonants, .and. spectra and transition for fricatives. 
These characteristics provide' considerable information on the identity of 
segments. In the, speech of the hearing impaired, acoustic analysis of 
consonant production has been made only for voice-onset-time (VOT), formant. 
transition, or closure and constriction duration, and these patterns give 
ample evidence of the great perceptual difficulty that listeners to the speech 
of the hearing-impaired experience. 

1 . Voiced-Voiceless Distinction 

At the acoustic level, contrasts .such as "voiced" versus "voiceless" or 
"aspirated" versus "unaspirated" ere manifested as complexes of acoustic cues 
(Slis 4 Cohen, * 1969). In the classic study of Lisker and Abramson (1964), 
releas? of the oral occlusion relative to the onset of glottal pulsing (i.e., 
, voice-onset-time or VOT) was the saiient cue that distinguished voiced from 
voiceless stops. As was previously discussed, errors in voicing are common in ; 
the 'speech of the hearing impaired. Some acoustic studies of their speech 
provide evidence that a lack of voice-onset-time contributes to the perception 
of the voiced-voiceless confusion. 

Perhaps the most careful study in this area has been conducted by Monsen 
(1976b). Spectrographic measurements of VOT were ma.de of word-initial stops 
/p, t, k/ and /b, d, g/, produced by 36 profoundly hearing-impaired children. 
Some of the children distinguished the cognates in the normal manner. VOT 
values were longer for the voiceless than voiced segments and VOT contrasts 
were longer for velars than fcr alveolars and bilabials, respectively. 
However, most of the hearing- impaired speakers did not observe the voiced- 
voiceless distinction and deviated from normal speakers in a similar way. 
Typically, voitfe-onset-time values for voiceless segments were lower than 
those for voiced, and also overlapped with the measurements for voiced. This 
pattern was noted for /p-b/ and /t-d/, although measurements for /k-g/ were 
more complex. Furthermore, tliese subjects did not distinguish VOT among stops 
based on place of articulation- Hearing-impaired speakers who observed the 
voiced-vciceless distinction typically had high speech intelligibility, prob- 
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ably b>cause they were capable of producing other aspects of -speech normally 
as well. Hearing-impaired speakers who did not observe these contrasts tended 
to collapse the voiced-voiceless categories, producing most segments as 
voiced. These speakers were considerably less intelligible than those who 
produced the voicing distinction. 

Findings similar to\Monsen's' have been reported in .the earlier work of 
Calvert (1961) and Irvin and Wilson (1973), and more recently as part of 
measurements made in studying the acoustic and articulatory correlates of the 
speech of the hearing impaired (Mahshie, 1980; McGarr & Lofqvist, in press; 
Stein, 1980). In the McGarr and Lofqvist study, the authors noted that VOT 
values for some of their hearing-impaired speakers fell in the range of 20-30 
msec, which is close to the perceptual boundary where shifts in the perception 
x of voicing have been shown to occur. This may be one reason why listeners to 
the speech of the hearing impaired have difficulty making judgments of 
particular phonetic segments. We will return to these physiological studies 

prmant Patterns of Transition 




HeaVing-impaired speakers have often been described as having difficulty 
in movingX their articulators correctly from one phoneme to the next (Calvert, 
1961; John\& Howarth, 1965; Martony, 1966; Smith, 1975). One -manifestation of 
this problem at the acoustic level is distortion of formant frequency^ 
transitions. x 

Changes in the formant frequencies, particularly the direction, extent, 
and durat.ion of the second formant transition, have been shown to be important, 
acoustic cues for\ the place of articulation (Delattre. Liberman, & Cooper. 
1955; Liberman. Delattre. Gerstman. & Cooper. 1956). As discussed above, 
hearing-! mpaired speakers characteristically produce many errors involving the 
place of articulation. 

While there have been only a few acoustic analyses of formant transition, 
of hearing-impaired speakers, these studies are nonetheless in general agree- 
ment (Martony. 1966; Monsen. 1976c; Rothman. 1976). In general, this work 
showed that formants were exceedingly short in duration or missing altogether, 
that the extent of the frequency range of the transitions was limited in part 
> because the formant frequencies for vowels were greatly neutralized, and that 
transition* varied little with respect to phonetic context. Also, the slope 
of the transitions frequently remained' fairly flat when either a rising or 
falling pattern was dictated. Thus. F2 transitions in the speech of the 
hearings-impaired may be reduced in both duration and frequency range. Th ese 
patterns, together with deviations in the steady state formant frequencies for 
vowels (to be discussed later), suggest that hearing-impaired speakers have 
reduced articulatory movement and ah absence of the coa-ticuatory effects 
observed in the speech of normals. 
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C. Production of Vowels and Diphthongs 

1 * Overview 

Hudgins and Numbers (1942) were again among the first investigators to 
study the % production of vowels and diphthongs systematically in the speech of 
the hearing impaired. They classified the errors according to five major 
types: 

1. Substitution of one vowel for another 

2. Neutralization of vowels 
3* Diphthongization of vowels 

4. Nasalization of vowels 

5. ' Errors involving diphthongs: either the diphthong was split into two 

distinctive components 9 or the final member of the diphthong was 
dropped. * 

i 

In this study, substitutions and neutralization of vowels, and difficulty 
with the production of diphthongs were among the most coranon errors. 
Essentially the same pattern has been replicated in other studies ?f hearing- 
impaired speakers regardless of whether the vowel was produced in a CVC 
framework (Angelocci, Kopp, & Holbrook, 1364; Calvert, 1961 ) f in ;est words 
(Geffner, 1980; Mangan, 1961; Markides t 1970; Nober, 1967) t or in sentences 
(Smith, 1975). , . 9 ' ' 

There -is also agreement concerning the frequency of vowel versus conso- 
nant errors. Overall, fewer errors in vowel production have been reported, 
although it should be noted that this finding may be influenced by variables 
in both speaker production and listener perception. For example, Brannon 
(1966) claimed that vowels were in fact easier for heading-impaired speakers 
•to produce than consonants, since vowels were supposed to require less precise, 
articulatory position. Perceptually, Hudgins and Numbers (1942) and later 
Honsen (1976c) suggested^that listeners tole r a t e a - greater degree of distor- 
tion in vowels than in consonants, hence the report of fewer .vowel errors. 
Furthermore, acoustic information conveyed in the vocalic position of thep 
stimulus also provides .information of consonants, and thus, if erroneous (as 
we will discuss later), may directly affect the perception of the consonant. 
In general, it should also be noted that fewer vowels . than consonants are 
produced in running speech, thus there is less opportunity for error. 

2. Vowel Errors , ' 

Traditional classification schemes for' vowels employ such categories as 
tongue position (high-low,, front-uack)., tongue tension (tense-lax), and degree 
of lip rounding. These refer to arti« Mlatory events and are important to our 
subsequent discussion of the acoustic characteristics of vowels. In general, 
hearing-impaired speakers have been found to produce back vowels correctly 
more often than front vowels (Boone, 1966; Geffner, 1980; Mangan, 1961; Nober, 
1967; Smith, 1975) and low vowels correctly more often than those with raid or 
high tongue positions (Geffner, 1980;^ Nober, 1967; Smith, 1975) 4 In tact, 
Boone (1966) suggested that hearing-impaired speakers tend to keep their 
tongue retracted in a low back position. In contrast , Stein 1 s ( 1 980 ) 
cinefluographic study of vowels produced by hearing-impaired speakers showed 
"fronting" of back vowels. 
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. With respect to errors of substitution, hearing-impaired speakers often 
fail to make the tense-lax distinction (Mangan, 1961; Monsen, 197*** Smith, 
1975), although there is evidence to the contrary (Hudgins & Numbers, 19*»2; 
Markides, 1970)., The commonly observed error of neutralization, a problem 
akin to substitution, has been noted in the descriptive literature (Heider et 
al., 1941; Markides,. 1970; Smith, 1975) as well as in 'acoustic studies 
(Angelocci et, al., 1961; Monsen, 1976a, 1978"). This work suggests that the 
hearing-impaired speaker tends' 'to produce vowels with a pattern appropriate 
for the neutral vowel /*/. "DBs error has implications at the segmental as 
well as the suprasegmental lWel since, in the latter case, the syllable is 
shortened and often not given the appropriate stress. 

Other commonly, reported errors in vowel production include -inappropriate 
nasalization of vowels. (Martony, 1966; Stevens et al., 1976) and dipht'hongiza- 
tion of pure vowels (Boone, 1966; Markides, 1970; Smith, 1975). With th<r 
exception of Hudgins and Numbers (19t2), very little additional data have been 
collected on production of diphthongs, the error patterns reported being 
essentially the same (Levitt et al., 1980; Nober, 1967). 

D. Acoustic Characteristics of Vowels 

The acoustic characteristics of vowels and diphthong production have been 
studied in great detail in normal speech production, but^aga*n there is little 
i'n the speech of the hearing impaired. We will concentrate here primarily on 
studies of vowel formants, leaving a discussion of timing characteristics 
(i.e., duration) and segmenl^l influences on fundamental frequency until later 
in the chapter. 1 

The formant frequencies, especially the first (F,) and second*?"^ ) 
formants, are traditionally used to provide an acoustic description of-vowdls. 
Usually, -these formant values are ^.plotted against each .other ,_and_the__|at§._ 
points for each vowel cluster into fairly distinctive regions (cf. Peterson & 
Barney, 1952). Interestingly, the acoustic vowel plot of and F2 closely 
resemble the articulator vowel map. Although the relationship between, 
acoustic and articulator y correspondence is riot simple, it has .been suggested - 
that F v (which increases and then decreases as the vowels go from /i/ to- /u/) 
represents tongue height, and that F 2 (which decreases from /i/ to /u/; ^ 
represents the constriction of the tongue ir. the front-back plane. Of course, 
events such as degree of lip rounding, pharyngeal constriction, as ..well as 
individual speaker differences must also be considered. 

Analysis -of spectrograms of this population is not without problems. In 
many cases there are *kra harm6nics and ,&he fundamental frequency (Fo) of 
hearing-impaired speakers may often be quite high. This may create a "iVMtch 

•between the source and the bandwidth of the spectrogram . filter used in 
analyzing the acoustic signal, this- problem is similar to that faced in the 
spectrographs analysis of young hearing children's speech (cf. Huggins. 
ml) 1 Spectrograph^ analysis of the speech of the hearing impaired is 
further complicated by pertuuations in the source, inappropriate management of 

• intensity, and/or inappropriate nasalization that introduces zeros into the 
frequency domain. This often- precludes easy ana straightforward analysis. 
Some of these problems may be circumvented by the use of digital analysis 
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techniques such m Lroear Predictive Coding (LPC). Even with the use of LPf, 
determination of J^mnant frequency location may still be difficult. 

There have been several studies examining the -acoustic characteristics of 
vowels produced by hearing-impaired children using spectrographs aralysis^ 
(Angelocci' et al. t 1964; Monsen t 197^» 1978), and one study in which the 
speech was digitized and subjected to LPC analysis (Osberger, Levitt, & 
Slosberg, 1979)* Besides instrumentation , differences, ' these studies also 
differ in that the latter work includes only productions perceived as correct 
in hearing- impaired children 1 s speech, while the Other studies are not clear 
% with respect to this point. Nonetheless, the results of these studies show 
that the formant frequencies of deaf children 1 s vowels tend toward that of the 
neutral- vowel /d/. This result is of further interest since the hearing- 
• impaired subjects in both the Monsen and the Osberger et al. studies, produced ' - 
vowpls in sentence context, while subjects in Angelocci et al. produced vowels 
in CVC monosyllables. The data from these studies are interpreted 'to suggest 
that hearing-impaired speakers use a restricted amount of^ tongue movement to 
achieve vowel differentiation. Indeed, several investigators (Angelocci et 
al., 1964; Martony, 1968) have suggested that differences in vowels produced 
by hearing-impaired - speakers are achieved primarily by means of variation in 
fundamental frequency. 

^In addition to reduced phonological space for all vowels and extensive 
overlapping of* vowel areas, Monsen (1976a) also noted that the second formant 
of vowels remained around 1800 Hz rather than varying as different vowels were 
articulated. This "immobility of F 2 i. mt only deloteriously affects percep- 
tion of the vowel' but also interferes with transmission of consonantal 
information. The difficulty with F 2 is not surprising, since many hearing- 
impaired speakers have residual hearing only in the frequency rafige of F 1 and 

not in the range of F 2# Another factor is that the fror*--back tongue movement 
associated with second formants is impossible for a de«^ student to observe. i • 
Articulator^ movements such as jaw movement i associated with F-j C hahge are 
certainly -more visible. — — 

Very little is known about/the acoustic aspects of diphthong production * 
in the hearing impaired. Monsen (1976d), using spectrographs measurements of 
the diphthong /al/, has classified deviant acoustic patterns on the basis of 
frequency change- during- production of the diphthong. One deviant pattern is 
characterized by. a large change in the , frequency of F^ with an immobility cf 
F2. Monsen hypothesized that this pattern results when the appropriate jaw 
movement is not accompanied by appropriate movement of the tongue. Minimal 
movement of both F-| and F2 was, another pattern observed, which Monsen 
attributed to a generally stable vocal tract throughout production of the 
diphthong with minimal jaw movement. *: A third pattern was a Reversal of the 
direction of movement of F 2 with respect to normal. Monsen hypothesized this' 
to be the acoustic consequence of the diphthong being produced with the tongue 
lowered and retracted. 
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-IV. NON-SEG MENTAL PATTERNS IN THE SPEECH OF \ SEVERELY AND 
PROFOUNDLY HEARING-IMPAIRED SPEAKERS 

> + 

This section will present information on the non~ar?ticulatory aspects of 
hearing- impaired children 1 s speech. These patterns are also referred to as 
suprasegmental because they involve characteristics of speech, that extend over 
units composed of more than one phonetic segment. Included in this category 
$re characteristics such as timing, intonation, and stress assignments. These 
areas, as well as the acoustic correlate of pitch (fundamental frequency) and 
factors affecting perceived voice quality will be described in this section. 

A. Timing Patterns 

1 . Overall Speaking Rate 

With few exceptions, the, speech of the severely and profoundly hearing 
impaired is perceived as being too slow and sounding very labored. Physical 
measures of speaking rate haye shown that profoundly hearing-impaired speak- 
ers, on the average, take 1.5 to 2.0 times longer to produce the - same 
utterance as do normal-hearing; speakers (Boone, 1966; Heidinger, 1972; Hood, 
1966; John & Howarth, 1965; Voelker -1935, 1938). The reduced speaking rate 
is due to the excessive prolongation of speech segments and the insertion of 
pauses. „ • ' 

Prolongation of speech segments may be present in the production of 
photiemes, syllables, and words. Calvert (1961) was among the first to obtain 
objective measurements of phonemic duration in the speech of the. hearing 
impaired by spectrographic analysis of bisyllabic words. The results of this 
study showed that hearing- impaired speakers extended the duration of vowels, 
fricatives, and the closure period of plosives up to five times the average 
duration fdr normal speakers. In a later study, Osberger and Levitt (1979) 
observed that syllable prolongation in the speech of the hearing impaired was 
due primarily to prolongation of vowels. 

Figure I shows data obtained by Osberger' (1978) on mean syllable duration 
in a sentence produced by six normally-hearing and six profoundly hearing- 
impaired children. The data show a distinctive pattern of syllable durations 
for the two groups of speakers. The line connecting the data points of the 
hearing-impaired speakers lies above and is approximately parallel to that of 
the hearing children. The exception to this is the sixth syllable where the 
mean syllable duration is shorter for the hearing- impaired than the normal 
speakers. This was due to the omission of some of the phonemes in the 
syllable by the hearing-impaired speakers, making the duration of the syllable 
shorter than would be expected if all of the intended segments had been 
produced. The size of the standard deviations, shown by the vertical bars, 
indicates that there is greater variability in syllable duration among the 
.hearing-impaired than among the normal speakers. 

Profoundly hearing-impaired speakers typically insert more pauses, and 
pauses of longer duration, than do speakers with normal hearing (Boone, 1966; 
Boothroyd, Nickerson, & Stevens, 1974; Heidinger, 1972; Hood, 1966; John & 
Howarth, 1965; Stevens, Nickerson, & Rollins, 1978). Pauses may be inserted 
at syntactically inappropriate boundaries such as between two syllables in a 
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Figure 1. Mean duration (msec) for syllables in the sentence "I wish I could 
read that book" produced by six normal-hearing children and six 
hearing- impaired children. The standard deviation is represented 
by the vertical bars (after Osberger f 1978). 
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bisyllabie word or within phrases. The greatest difference between normal and 
hearing-impaired speakers has been observed in the durations -of inter- and 
intraphr3se pauses (Stevens et al., 1978). The results of Hudgins (193*», 
1937. 1 916) suggested that the frequent pauses observed in the speech of the 
hearing impaired may be the result of poor respiratory contrcl. Specifically, 
Hudgins found that deaf children used short, irregular breath groups often 
with only one or two words, and breath pauses that interrupted the flow of 
speech at inappropriate places. In addition,- there was excessive expenditure 
of breath on single syllables, false groupings of syllables, and misplacements 
of accents. Later, we shall discuss the propensity of hearing-impaired 
speakers to use inappropriate breath groups. 

2. Segmental Timing Effect s 

Acoustic analyses of normal speech have shown that the duration of vowels 
is systematically influenced by effects operating at the level of phonetic 
segments. . Since vowels form the nuclei of the larger segments of speech, 
these differences in vowel duration exert substantial effects on both the 
production and perception of the temporal and segmental aspects of speech. 
Vowels have been described as having an intrinsic duration (Peterson & 
Lehiste, 1960) and, in comparable contexts, some vowels are consistently 
shorter than other vowels (House, 1961). Hearing-impaired speakers with 
severe and profout.u losses have been found to distort this relationship 
between the vowels. For example, Monsen (197 1 *) observed that /i/ was 
relatively longer than s /I/ in monosyllabic words in the speech of normal- 
hearing subjects, but irts the speech of profoundly hearing-impaired children, 
there was a tendency for these vowels to occupy mutually exclusive duration 
ranges. McGarr and Harris (1980), on the other hand, found that the 
profoundly hearing-impaired speaker in their study did not show consistent 
differences in intrinsic vowel duration. 

There is substantial literature showing that the average duration of 
vowels also varies markedly as a function of phonetic context in normal 
speech. When different phonetic contexts are considered, the voicing charac- 
teristic of the following consonant has been shown to have^ a consistent effect 
on preceding vowel duration; for normal speakers, the duration of a vowel 
preceding a voiceless consonant is less than the vowel duration preceding a 
voiced consonant in stressed, syllables (Denes, 1955; House\J961; House & 
Fairbanks 1953; Peterson 4 Lehiste, 1960). This systematic change in vowel 
duration has been found to be a significant perceptual cue to Hhe voicing 
characteristic of the following consonant or consonant cluster ^(Raphael , 
1972) Results obtained by Calvert (1961) and Monsen (197 1 *) have shown that 
the hearing impaired fail to produce the appropriate modifications m^vowel 
duration as a function of the voicing characteristics of the following 
consonant. Thus, the frequent voiced-voiceless confusions observed in the" 
speech of the deaf may actually be due to vowel duration errors (Calvert, 
1961). 

3. Suurasegmental Timing Effects 

Th.; duration of segments is also influenced by effects operating at the 
level of syllables, words, and phrases. In English, changes in contrastive 
stress have been found to produce systematic changes in vowel duration. When 
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vowels are stressed, they are longer in duration than when the same vowels are 
unstressed (Parraenter & Trevino, 1936). This durational variation has also 
been found to be an important cue for the perception of stress (Fry f 1955 f 
1958). 

Several investigations have* shown that while hearing-impaired speakers 
make the duration of unstressed syllables shorter than that of the stressed 
syllables, the proportional shortening is smaller, on the average, in the 
speech of the heading impaired than in the speech of normal subjects' (Osberger 
4 Levitt, 1979; Stevens et al. t '1978). In contrast to this, Reilly " (1979) 
found larger than normal duration differences between vowels in primary- and 
weak-stress syllables produced by a group of profoundly hearing-impaired 
children. These, data are shown in Figure 2. In this figure, duration has 
been calculated for the vowels /i, I, u/ produced in both primary- and weak- 
stress syllables by hearing and hearing-impaired children. For /if and /u/, 
longer average durations ■ were measured for greater stress for both grpups, 
with the hearing impaired durations being longer overall, and the difference 
between the primary and, weak syllables being more extreme than in the samples 
produced by the hearing children. There was almost no difference in duration 
between the primary and weak /I/ in the normal children's samples, whereas the 
hearing-impaired speakers produced longer durations of /I/ in weak syllables 
than primary stress syllables. 

Exactly how a hearing-impaired speaker uses terciporal manipulations to 
convey differences in syllabic stress pattern is not clear. In a recent 
study, McGarr and Harris (1580) found that even though intended stressed 
vowels were always longer than unstressed vowels in the speech of one 
profoundly hearing-impaired speaker, the intended stress pattern was not 
always perceived correctly by a listener. Thus, the hearing-impaired speaker 
was using some other suprasegmental feature to convey contrastive stress. 
Variation in fundamental frequency would .be a likely alternative, but McGarr 
.and Harris also found that while the hearing-impaired speaker produced the 
systematic changes in fundamental frequency associated with syllable stress, 
perceptual confusions involving stress pattern wer still observed. 

Another suprasegmental temporal effect occurring in normal speech is 
prepausal lengthening. When a syllable occurs before a pause that marks a 
major syntactic boundary, it is longer in duration than when< it occurs in 
other positions in a phrase (Klatt, 1975). It has been observed that hearing- 
impaired speakers do not always lengthen the duration of phrase-final syll- 
ables, relative to the duration of the other syllables in the phrase. Stevens 
et *al. (1978) observed that when there was evidence of prepausal lengthening 
in the speech of profoundly hearing-impaired talkers, the increase in the 
duration of the final syllable was much smaller, on the average, for the 
hearing-impaired than for the normal-hearing speakers. In contrast to this, 
Reilly (1979) found that the profoundly hearing-impaired speakers in her study 
used duration to differentiate prepausal and non-prepausal syllables. As was 
the case for primary- and weak-stress syllables, discussed above, Reilly 
observed a larger than normal difference between the duration of syllables in 
the prepausal and non-prepausal position in the samples produced by the 
hearing-impaired children. 
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Figure 2. Mean ',owel duration (msec) in primary- and weak-stress syllables 
produced by a group of normal-hearing and a group of profoundly 
hearing- impaired children (after Reilly, 1979). 
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The information presented above, clearly shows that profoundly hearing- 
impaired speakers distort many temporal aspects of speech. These .distortions, 
such as excessively prolonged speech segments, and the insertion • of both 
frequent and lengthy pauses, are perceptually prominent and disrupt the 
rhythmic aspects of speech. In spite of these deviancies, there is evidence 
suggesting that hearing-impaired talkers manipulate some aspects of duration, 
such a$ those involving relative duration, in a manner similar to that of 
speakers with normal hearing. 

B. Fundamental Frequency Patterns 

1 . Average Fundamental Frequency 

Among the most notideable speech disorders of the hearing impaired are 
those involving fundamental frequency (Fo). In normal speech,*, there are 
differences in average fundamental frequency depending on the sex and age of 
the speaker. Reported- fundamental frequency values range from 100-175 Hz for 
adult males and from '175-250 Hz for adult females ^Fairbanks, 1940; Fairbanks, 
Wiley, & Lassman, 1949; Fairbanks, Herbert, & Hammond, 1949; Hollien & Paul, 
1969); Recent data (Hasek, Singh, & Murry, 1981) suggest that a significant 
difference between the average Fo of preadolescent male and female children 
with normal hearing begins to emerge by seven or eight years of age, with the 
sex difference attributable to a reduction in Fo for male children only, 
beginning around age seven. No significant preadolescent age-related change 
in Fo in females was observed. 

If there is a problem with a hearing-impaired speaker f s average fundamen- 
tal frequency, more often the voice pitch is characterized as too high rather 
than too low (Angelocci et al., 1964; Boone, 1966; -Martony, 1968). Some 
differences in average Fo have been found as a> function of the age or sex of 
the hearing-impaired speaker. The results of several studies have shown that 
there are no significant differences in average Fo between young hearing and 
hearing-impaired children in the 6-12 year age .range XBoone; 1966; Green, 
1956; Monsen, 1979). Differences have been reported between groups of older 
children but it is not clear if pitch deviation is greater for hearing- 
impaired females or males. Boone (1966) found^ a higher average Fo for 17-18 
year old males than females. Osberger (1981 ) found that the difference in Fo 
between hearing and hearing-impaired speakers in the 13-15 year age range was 
greater for females than for males. , This finding' is illustrated in Figure 3» 
which shows the Fo values averaged a.croks sentences for six normal-hearing and 
ten hearing-impaired subjects. As can be seen, ^the Fo for the female hearing- 
impaired speakers ranged between 250-300 Hz. This value is about 75 Hz higher 
than that observed for the normal-hearing females,. The average Fo value of 
the utterances of the male hearing-impaired speakers is slightly lower than 
that of the hearing males for the first part of the utterance. The Fo .values 
for the hearing and hearing-impaired male speakers overlap for the last half 
of the utterance. Bush (1981) observed excessive segmental variations in Fo 
for a small group of profoundly hearing-impaired females in the same age range 
as those in the Osberger study. Age-related factors such as laryngeal growth 
accompanied by adolescent voice change or similarities in speech training were 
suggested by Bush as reasons for the problems of the- females in controlling 
Fo. 
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Figure 3. Fundamental frequency values in Hz, measured at the center of the 
vowel in *ach syllable in the sentence "I like happy movies better 
for groups of normal-hearing and profoundly hearing- impaired males 
and females. 
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Up to this point, we have limited our discussion to physical measures of 
fundamental frequency. In a clinical or schopl situation, the examiner will 
not, in most cases, have the equipment necessary to make ,sucfr measurements* 
In these settings, the clinician will have to rely on his or tyer perceptual 
abilities to evaluate the appropriateness' of the child's pitch. The pitch 
deviancy of profoundly hearing-impaired children has been evaluated perceptu- 
ally by McGarr and Osberger (1978), using a five-point rating scale. The 
profile rating of pitch register (Subtelriy, 1975) and the descriptors are 
shown in Table 1, Tfte scale was used with approximately 50 children 10-11 
years of age. The results of this study showed that, a large number of the 
children received pitch ratings that were either appropriate for their age and 
sex or differed only slightly from optimal level. Thirty-two of the children 
received an average rating higher than 4.0. There ,was, however, a Small group 
of children who could not sustain phonation and whose speech was characterized 
by pitch breaks or large fluctuations in. pitch. On the whole, these findings 
are - in agreement with earlier studies indicating that the pitch o ' many 
preadolescent hearing-impaired children is within the normal range. It is not 
clear to what extent the average Fo of a hearing-impaired child 1 s. speech can 
differ from tha£ of a normal child before it is perceived as deviant and 
hence, remedial training is indicated. 



Table 1 

Sating Scale Used to Evaluate Pitch (from Subtelny, 1975) 



Profile 

R ating Functional Descriptor 



1 . Cannot sustain phonation 

2. Much above (+) or much below (-) optimal level 

3. Moderately above (+) or below (-) optimal level 

4. Slightly above (+) or below (-) optimal level 

5. Appropriate for age and sex 
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2. Intonation Patterns 



Intonation is the perceived pattern of change in fundamental frequency 
within a phrase . or sentence. Reference is made, even in the very early 
literature, to the difficulties that hearing-impaired speakers experience in 
controlling this aspect of speech. Haycock (1933), Rawlings (1935), Russell 
(1929), Scripture (1913), and Story (1917) all describe the speech of 
congeriitally deaf persons as "monotonous" and "devoid of melody." Later 
investigations showed that hearing-impaired speakers did produce pitch varia- 
tions, but the average range was more reduced than those of speakers with 
normal hearing (Green, 1956; Hood, 1966; Voelker, 1935). 

- Some hearing-impaired speakers may' demonstrate an intonation problem in 
the form ~~of excessive and inappropriate changes in fundamental frequency. 
These speakers may raise or lower Fo 100 Hz or more within the same utterance. 
Often, after a sharp rise in fundamental frequency, the hearing-impaired 
'speaker loses all - phonatory control, and there is a complete cessation of 
phonation (Monsen, 1979; Smith, 1975; Stevens et al.» 1978). 

Figure 4 shows the intonation contour of a simple, declarative sentence 
spoken by a normal, 14-year-old female. There is a rise in Fo at the 
beginning of the sentence with a peak on the first stressed syllable (the 
second ^ syllable in the sentence). As the sentence is produced, there is a 
gradual reduction in Fo, known as declination. The sharp drop that occurs in 
Fo at the end of the sentence is referred to as the terminal fall. Figure 5 
shows the contour of tne same sentence spoken" by a hearing-impaired male 
speaker, 14 years of age, judged to have insufficient variation in intonation. 
Note that the extent of the change in the Fo throughout the utterance is more 
restricted than that observed for the child with normal speech in Figure ,4. 
In contrast to this pattern, Figure 6 shows contours for two females, 14 years 
old, who- produced the sentence with excessive and inappropriate changes in fro. 
Speaker 1 produced the first part of the sentence with a sharp rise in Fo, 
followed by a sharp fall in Fo over the last half of the utterance. Speaker 2 
produced inappropriate fluctuations in Fo throughout the entire utterance. 

There have been few' attempts to arrive at a quantitative classification 
of intonation contours produced by hearing-impaired children. Monsen (1979) 
has described the following four types of contours that he found to occur in 
the production of CV syllables by 3- to 6-year-old hearing-impaired children: 
(1) a falling contour, characterized by a smooth decline in Fo at an average 
rate greater than 10 Hz per 100 msec; (2) a short-falling contour, occurring 
on words of short duration— the Fo fall may be more than 10 Hz per 100 msec 
but the total change may be small; (3) a falling-flat contour, characterized 
by a rapid change in frequency at the beginning of a word, followed by a 
"relatively unchanging, flat portion; (4) a changing contour, characterized by 
a change in frequency, the duration of which appears uncontrolled, and extends 
over relatively large segments. 

Monsen (1979) found that the type of contour appeared to be an important 
characteristic in separating the better from the poorer hearing-impaired 
speakers. His classification scheme represents a substantial step forward in 
describlrg the intonation patterns of the hearing impaired. It remains to be 
determined if such a classification scheme can be used to describe objectively 
the intonation patterns of entire sentences as well as isolated syllables. 
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Figure 4. The intonation contour of the simple declarative sentence, "I like 
happy movies better t " spoken by a normal-hearing child. Each data 
point is the fundamental frequency value in Hz', measured at the 
center of the vowel in each syllable in the sentence. 
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Figure 5. The intonation contour of the sentence "I like happy movies better" 
• spoken by a profoundly hearing-impaired speaker judged to produce 
insufficient v?riation in intonation. Each data point is the 
fundamental frequency value in Hz, measured at the center of the 
vowel, in each syllable of the sentence. 
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Figure 6. The intonation contour of the sentence n l like happy movies better" 
spoken by two profoundly hearing- impaired females judged to prod dee 
excessive and inappropriate changes in fundamental frequency. Each 
data point is the fundamental frequency value in Hz, measured at 
the center of the vowel in each syllable of the sentence. 
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• One factor that -strongly influences Fo changes is the degree of stress 
placed on syllables within a breath group. Typically, stressed syllables are 
spoken with a higher fundamental frequency than are unstressed syllables (Fry, 
1955). Thus, the contour consist's' of peaks (rises) -and valleys (falls) in Fo 
that correspond to the stressed and unstressed syllable pattern of. the 
sentence. This pattern has been observed to be distorted in the speech of the 
hearing impaired. An example of this distortion 4 is apparent, in the Fo 
contours of the two speakers in Figure 6. 

3. Segmenta l Influences on Fundamental - Frequency Control . 

A common clinical observation is that some hearing-impaired children 
produce the vowels /i, I, u/ on a higher Fo than the other vowels of English. 
It has been sho"6n that there is a systematic relationship between vowel and Fo 
in normal speech. High vowels are produced on a higher Fo^ than^low vowelS-, . 
resulting in an inverse relationship between Fo .and the frequency location of ■ 
the first formant of the vowel (House & Fairbanks, 1953; Peterson & Barney, 
1952) Angelocci et al. (1964) first examined some of the vowel changes in Fo 
in the speech of the hearing impaired. Their results showed that the average 
Fo and amplitude for all vowels was considerably higher for the hearing- 
impaired than for the normal subjects. In contrast, the .range of 'frequency 
and amplitude values for the vowel formants was greater for the' normal-hearing 
than for the hearing-impaired speakers. This finding, combined with the, high 
Fo and large amplitude values, led Angelocci et al. to suggest that the 
hearing-impaired' subjects attempted to differentiate vowels by excessiye 
laryngeal variation rather than with articulatory maneuvers as do normal- 
hearing speakers. 

A rec^Tt study by Bush (1981 ) does not support a simple trade-off between 
Fo variability and articulatory skill. Bush observed 'a close relationship 
between vowel-related variability in, Fo - and articulatory skill for the 
majority of profoundly hearing-impaired subjects- in her study. In general, 
greater Fo variability was observtd for the hearing-impaired speakers who 
produced a wide range of vowel sounds (in terms of F-| a nd-F2 values) and who 
were more intelligible than speakers whose articulator* skills were more 
limited. Bush also noted that although the amount of Fo variation with vowels 
used by the hearing-impaired speakers was- greater, on the average, tnan that 
used by the hearing speakers, the direction in which Fo varied as a, function 
of vowel height was similar for the two groups of speakers. 

On the basis of these observations, Bush' concluded that the vowel-to- 
vowel variations produced ' by the hearing-impaired speakers were, in some way, 
a consequence of the same articulatory maneuver used by normal speakers in 
vowel production. These data are discussed in terms of Honda -s (Note , 1) 
account of vowel-related variations in Fo for normal speakers. Briefly, 
Honda's mechanism assumed that moving the tongue root forward for the 
production of high vowels causes the thyroid bone to move forward, tilting .he 
cartilage' anteriorly. As a result of these maneuvers, there is 
-tension on the vocal folds, resulting in an increase in Fo. \u s h Jas 
postulated that because of - the non-linear nature of the stress-strain rela 
lionship for vocal-fold tissue, increases in vocal r fold tension ^ma y be gtjter 
in magnitude when the tension on the vocal folds is already relatively high 
(as is the case with hearing-impaired speakers), resulting in somewhat larger 
increases in Fo during the articulation of high vowels. '25? 
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In summary, as was^ observed for some of the temporal patterns of speech, 
it appears that' profoundly hearing- impaired speakers encode and organize some 
aspects of fundamental frequency with respect to syntactic considerations in 
much the. same manner as do normal speakers. There are obvious deviancies in 
fundamental frequency control in the sp'eech of the hearing impaired, but. there 
is, evidence to suggest that they know and use some 'of the same rules applied 
by normal-hearing speakers. 

C. Production Patterns Affecting Voice Qjality' 
1. Voice Quality 

; \ 
It is not unusual, to find people who, after working with the . pro found lj 

hearing impaired, claiip that the speech of. this population has a distinctive 
qualiry that differentiates it from other speakers. 'Calvtert (J96D* found, that 
teachers of the hearing impaired could reliably differentiate 7 the voices of '« 
profoundly hearing- impaired speakers from normal spfeak^P^,* provided the speech 
, sample's contained articulatory movement, such, as that required for the 
* production of a diphth&ng or ar CVC syllable. Productions with negligible 
articulatory movements, such as sustained vowels, .failed to provide* the 
experienced listeners with the necessary information for the correct identifi- 
cation of speakers. On the basis of these findings, Calvert concluded" that 
the distinguishing characteristics of the -speech of tne profoundly hearing 
impaired are associated with articulatory movement over time, rather th3n with 
voice quality per se. * 

In the same study, Calvert (1961) also found that there was a great deal 
of variability among teachers in choosing the 4 characteristics .they .felt 
described most closely the voice quality of the hearing impaired. Thus, 
although the deviant voice quality of the hearing impaired can tje recognized 
easily the characteristics that contribute to the perceived deviation are 
difficult to characterize. 

In a recent study, Monsen (1979) quantified some of these characteris- 
tics. Acoustic analysis of duration, fundamental frequercy, and^ phonatory 
control were correlated with ratings ot voice quality foY monosyllables 
produced by young hearing- impaired children. The results of this study showed 
that the fundanental frequency contour -appeared to be the most general 
acoustic characteristic differentiating the children with gejtter voices from 
those with poorer voices. Children with good voice-qual'ity ratings had 
, fundamental frequency contours that fell within an appropriate range and that 
varied over time in an appropriate manner. In contrast, children with poor 
voice quality produced intonation contours that ,were* excessively flat or 
excessively changing. Monsen (1979) concluded that while other deviations 
3uch as poor vowel quality, breathiness , and duration errors may exert a 
strong influence on perceived voice quality in individual cases, th^y do not 
appear to be the major factors in determining the quality of the voice. From, 
the results of this study and those oT Calvert (1961), it appears that the 
distinctive voice quality of the hearing .impaired may be due to both poor 
articulatory timing control and inadequate control of source function. 
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2. Nasalization 



Proper control of the velopharynx has been recognized as a source of 
difficulty for hearing-impaired speakers for many years (Hudgins, 1934). . If 
the velopharyngeal' port is opened when it should be closed, the speech may be 
perceived as hypernasal; if it is closed when it should be opened, hyponasali- 
ty will result. Problems in nasalization control are often described as 
affecting voice quality because hyper- or hyponasality affects the resonant 
properties of speech. Improper velopharyngeal control • may also result in 
articulatory errors, a problem addressed edrlier in this chapter. 

Tn"a cLinical setting, the evaluation - of velopharyngeal control is 
usually made on the basis of qualitative judgments, which are often difficult 
to assess because they may be influenced by the presence of other deviancies. 
Stevens et? al. (1976) have attempted to overcome this problem by developing a 
procedure to quantify the degree' of nasalization for nasal and non-nasal 
sounds in the speech of hearing-impaired children. Measurements qf nasaliza- 
tion have been obtained with an accelerometer attached to the surface of the 
nose. The accelerometer picks up vibrations of the nose when there is 
velopharyngeal opening during a voiced sound. Stevens et al. have evaluated 
adequacy of velar control by comparing the amplitude of the accelerometer 
signal (in decibels) for nasal consonants to the amplitude of vowe^ sounds 
that should be produced without nasalization. x For normal-hearing speakers, 
the amplitude difference between these measures Is in the range of 10-20 dB. 
Using amplitude difference as an index of nasalization, Stevens et al. found 
that 76* of the profoundly hearing-impaired children studied had excessive 
nasalization in at least half of the vowels produced in monosyllabic words. 
Excessive nasalization on at least 8 of the 10 vowels studied was observed for 
36% of the children. The greatest difficulty in velopharyngeal control was 
evidenced in the hearing-impaired children's production of nasal-stop clus- 
ters which required closely coordinated movements of the velopharynx and oral 
articulators. Almost half of the hearing-impaired children made an error on 
at least one word with a nasal-stop cluster. 

3. Breathy Voice and Glottalization 

These problems are caused by improper adjustment of the vocal folds. 
Breathiness occurs when there is excessive airflow during voicing, resulting 
in generation of turbrlence noise a t the glottis. The vocal folds do come 
together rapidly, which affects the shape of the volume-velocity waveform, 
resulting in an acoustic waveform with enhanced energy in the low frequencies 
and deficient energy in the high frequencies (Stevens et al., 1978). 

Glottalization involves the insertion of the glottal stop between syll- 
ables or words. It is caused by tightly adducting the glottal folds ^ then 
abruptly releasing them. Profoundly hearing-impaired children often substi-. 
tute glottal stops for consonants produced in the center and back of the mouth 
(Levitt <«t al 1976). There is a tendency for hearing-impaired children who 
inserJmany giottalizations in their speech to have lower intelligibility than 
those who do not (Stevens et al., 1978). 
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V. PRODUCTION PATTERNS IN THE SPEECH OF HARD-OF-HEARING CHILDREN 



A. Articulatory Patterns 

Until only recently, ^ittle attention has been paid to the speech of the 
hard-of-hearing child. This is probably largely due to the fact that the 
majority of these children are integrated into regular schools and they are 
not as accessible for- utudy -as the students attending day schools fo? the 
deaf. In addition, researchers traditionally have viewed the communication 
and education problems of the profoundly hearing impaired as more serious than 
those of the hard -of hearing and, thus, the majority of research effort has 
been devoted to the children who appeared to have the greatest need. We now 
know that the presence of even a mild hearing loss can affect speech and 
language development and interfere with academic performance. Often, hard-of- 
hearing children are neglected in the public school system. They frequently 
fail to receive the support services from appropriately trained professionals 
that they require in order to perform successfully in a regular class- (Davis, 
1977). 

The majority of information available on the speech of hard-of-*hearing 
children involves' analyses of articulatory skills. Relatively few studies 
have quantified suprasegmental production patterns and, for this reason, only 
the segmental aspects of the speech of Ihar.d^af-hearing children will be 
discussed. j 

i 

If it is assumed that the major difference between hard-of-hearing and 
profoundly hearing-impaired children is the! degree of hearing loss, it is to 
be expected that hard-of-hearing children would have better speech skills than 
children with profound Maring losses. iThis notion has, in fact, been 
supported by the results of several studies showing that, on the average, 
there is a lower frequency of vowel and consonant errors in the speech of 
hard-of-hearing .children than in the speech of profoundly hearing-impaired 
children (Gold, 1978; Hudgins 4 Numbers, 19U2^ Markides, 1970; Nober, T967). 

Probably the most comprehensive study on\ the speech of hard-of-hearing 
children h^as been completed by Gold (1978). rn this study, the articulatory 
errors made by mainstreamed heard-of-hearing (p\ire Tone Average of 80 dB HTL 
or less) * and deaf (PTA of 80 dB HTL or greater) children were compared. 
Phonemic i transcriptions we^e made of sentences\ read by the children that 
contained j all the phonemes of English. The data v^ere analyzed to determine if 
_the types ^of articulatory errors were the same for\the two groups of children. 
The results in terms of overall error rate revealed, not unexpectedly, that 
the' deaf group had significantly more segmental errors than the hard-of- 
hearing group. The data further revealed that the types of errors were 
similar for the two groups of children. These data are summarized in Table 2. 
Two calculations were made f'or each of the eight eAror types for both groups 
of children. The first calculation, error type as the proportion of intended 
phonemes for each of the two groups (shown in the f^rst column of Table 2), 
was derived from the frequency of the error type relative tp the total number 
of phonemes Tn the sample. The second calculation, error type as a proportion 
of all of the errors (shown in the second column), was \ performed to take into 
account the higher error rat(e of the deaf group. T^us, the proportion of 
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Relative Frequency of Articulatory Errors for Hard 
Hearing and Deaf Children (from Gold, 1978) . 
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errors was based on the relative frequency of the error type out of the total 
proportion of errors made by the group. Once the overall rate was taken into 
account, the data showed striking similarities in the frequency of an error 
type for the hard-of-hearing and deaf children. For example, the most 
frequent type of error for both groups was that of omission. As the data 
show, differences between the two groups were not substantial. 

The results of the Gold study showed that although profoundly hearing- 
impaired children produce more segmental errors than hard-of-hearing children, 
the relative proportion of errors for each error type is similar for both 
groups of children. Only a small number of phonemes showed any significant 
differences in the pattern of confusions between groups. Gold has concluded, 
at least for children in the same type of educational setting, that the degree 
of the hearing loss is more strongly related to the overall frequency of 
errors than to the kinds of errors that will be made. 

B. Pattern of Speech Errors of Different Populations of Children 

From the preceding discussion, it becomes evident that the pattern of 
articulatory errors is remarkably similar in the speech of different 
populations of hearing- impaired children. Two studies, those of Snith (1975) 
and Gold (1978), lend themselves to cross-population comparison because the 
same test materials and procedures were used by the two investigators. The 
major difference between the studies is the groups of children studied: Smith 
examined the segmental errors in the speech of profoundly hearing- impaired 
children in an oral day school for the deaf; Gold, as mentioned above, 
examined the segmental errors in mainstreamed hard-of-hearing and profoundly 
hearing- impaired children. Some of the data from these two studies have been 
plotted in Figure 7. In this figure, correct production of consonants is 
plotted as a function of place of production for the three groups of hearing- 
impaired children. Comparison of the data shows distinct patterns across 
groups of children. As might be expected, the hard-of-hearing children most 
often produced the consonants correctly, followed by the mainstreamed 
profoundly hearing-impaired children; the children in the school for the deaf 
produced the consonants correctly the least often. Note also that sounds 
produced in the front of the mouth were most often correct, followed by the 
back sounds; sounds produced in the middle of €he mouth were most prone to 
error, a finding discussed earlier in this chapter, 

V 

Gold did find some significant differences in the pattern of confusions 
made by the mainstreamed hearing- impaired children and the children in the 
school for the deaf. The children in the school for the deaf used more 
neutral vowel substitutions and omitted more consonants than did the 
mainstreamed children. They also substituted the glottal stop for /t/ and 
/k/, and /b/ for labial sounds more often than the profoundly hearing- impiared 
children who were mainstreamed. 

The results cf Gold's (1978) study show that although the nature of the 
confusions did not differ significantly between the hard-of-hearing and deaf 
children in the same educational setting, there were significant differences 
between the deaf children in schools for the deaf and those in the regular 
public schools. Similarities in segmental error pattern were also apparent 
across jroups of children. It should be mentioned that although the 
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Percent correct production of consonants plotted as a function of 
place, of production for three 'groups of hearing-impaired children 
(after Staith, 1975 and Golciy 1978). 
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mainstreamed children had better speech skills than the children in the school 
for the deaf f a causal relationship between speech skills and school setting 
cannot be concluded. 'Although it is possible that a hearing-impiared child's 
— speech may improve as a result of daily exposure to hearing children, the 
children in Gold's study may have been mainstreamed because of their good 
speech skills. 

1 

VI. MECHANISMS OF PRODUCTION CONTROL 

As we have described above, speech production skills of the hearing 
impaired have been examined using listener judgments, phonetic transcriptions, 
and acoustic analyses. While the descriptive literature is fairly detailed, 
there have been few physiological studies on the speech of the hearing 
impaired. This is surprising since technology is available and also because 
speech production skills in normals have been studied fairly extensively. 
Indeed, close to 50 years ago, researchers attempted objective measurements of 
hearing-impaired speech production in such areas as breath control (Hudgins, 
1936; Rawlings, 1935; Scuri, 1935), voice production (Hudgins, 1937; Voelker, 
1938), and articulation (Brehm, 1922; Hudgins, 193*0. Although by today's 
technological standards, the instrumentation in these studies was not very 
sophisticated, these researchers deserve our admiration for their ingenuity 
and creative insight. Their intuition and observations ar£ clearly not dated. 
Consider the following: 

"The most obvious fault in the speech-breathing of deaf children is 
that they have little or no control over the breath supply so that a 
great deal more breath than is necessary is allowed to escape on 
each syllable. They do not speak with normal chest-abdominal 
r action. They have not learned to group their syllables into breath 
groups and phrases. Instead, they often expend an entire breath on 
a single word. The reasons for this excessive use of breath is two- 
fold: The inco-ordinated (sic) movements of the breathing muscles 
and the maladjusted glottis" (Hudgins, 1937, p. 3^8). 

The observations of Hudgins and his contemporaries might be taken today 
as evidence for a breakdown in interarticulator coordination. That is, 
hearing-impaired speakers fail to coordinate the complex activity of respira- 
tion, phonation, and articulation, and the resultant errors in timing occur at 
the segmental and suprasegmental levels of speech production. 

Admittedly, there has been a long hiatus between the early research' 
efforts and contemporary rekindled interest in speech physiology of the deaf/ 
Whether the time lapse represents a period of preoccupied interest with that 
of describing the error patterns of hearing-impaired talkers, or reflects a 
lag in applying the technology and ideas of speech production in normals to 
speech production of the hearing impaired can only be conjecture. There may 
be some truth in each, but in any event, we now turn to some recent studies on 
the physioldfeical characteristics of deaf speech. 
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A. Respiration 

Studies on the respiratory patterns of profoundly hearing-impaired speak- 
ers have shown that they evidence at least two kinds of problems. The first 
is that tuey initiate phonation at too low a level of vital capacity, and also 
that they produce a reduced number of syllables per breath (Forner & Hixon, 
1977; Whitehead, in press). The second problem is that they mismanage the 
volume of air by inappropriate valv'ing at the laryngeal level. 

Recently, Hixon and his associates have provided some objective data on 
respiratory behavior both in normal (Hixon, Goldman, 4 Mead, 1973*. Hixon, 
Mead, & Goldman, 1976) and also in , hearing-impaired speakers (Forner & Hixon, 
1977). In these studies, magnetometers were used to measure changes in the 
anterior-posterior dimension* of the chest wall during respiratorv maneuvers 
and also speech. Hearing-impaired speakers were found to be like hearing 
speakers in some respects but not in others. For example, respiratory 
activity for non-speech activities such as tidal breathing was similar to 
normal. This has also been noted for other non-speech respiratory activities 
such as coordinative demands on the breathing mechanism for athletics. Also, 
Forner and Hixon (1977) showed that the mechanical adjustments of the 
respiratory mechanism in preparing to speak (i.e., the relative posture, of the 
rib- cage versus the abdomen) were often correct. These findings do not 
support the suggestibns of the early researchers that hearing-impaired speak- 
ers evidence inappropriate . posture problems such as rib cage-abdominal asyn- 
chronization. However, like the early researchers, Fornervand Hixon reported 
that hearing-impaired speakers paused at inappropriate linguistic boundaries 
either to inspire or alternatively to waste air, and thus they produced fever- 
syllables per breath unit. Hearing-impaired speakers were also found to 
initiate phonation at inappropriate lung volumes and to speak within a fairly 
restricted lung volume range. 

These results have been confirmed by Whitehead ' (in press), who has 
extended the findings of Hixon by examining different respiratory P atte ™* 
with respect to the speech intelligibility of hearing-impaired talkers. Not 
surprisingly, Whitehead showed that profoundly hearing-impaired speake.-s who 
were more intelligible had respiratory patterns similar to those o' normal 
speakers. For example, both groups initiated speech well funettonjl 
residual capacity (FRC) and terminated production within the mid-volume range. 
In contrast- hearing-impaired speakers who were characterized as semi- ■ 
taUUiSwi Initilwd speech at substantially lower lung volumes ana 
' continued speaking well below FRC. Speech attempted at such reduced lung 
volumes is exceedingly difficult because the speaker is hot king against the 
natural recoil forces of the respiratory mechanism. Furthermore, this 
aberrant respiratory pattern will also directly affect phonation. 

Control of the expiratory cycle for .speech is crucial for phonation and 
is particularly important in producing events such as changes in vocal 
intensity, accommodating different aerodynamic patterns WMiated with conso- 
nant production, as well as linguistic phrasing. To achieve such J?**™ 
events, the volume of expired air must be appropriately managed, and this 
usually occurs at the laryngeal level. Thus, during speech Production one 
might "think of the relationship of the larynx to the respiratory mechanism as 
* analogous to that of an air valve, where the valve must be open at certain 
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times to let the air escape ($.g., when producing a voiceless segment), and 
must be closed at the other times to preserve the breathstream. 

There are data suggesting that hearing-impaired speakers have difficulty 
in coordinating the events of respiration and laryngeal valving. For example, 1 
consider some aerodynamic studies of consonants produced by hearing-impaired 
speakers (Hutchinson 4 Smith, 1976; Whitehead, in press; Whitehead & Barefoot, 
1980). The method of data collection in these studies was similar; air flow 
was measured using a face mask coupled to a pneumotachograph. Air flow 
measurements are taken to reflect .the relative open or* closed state of the 
vocal tract. For normals, voiceless plosives would be produced with greater 
peak airflow than 'their voiced cognates; fricatives would be produced with 
greater airflow than plosives. Overall, Whitehead and others cited above have 
shown that hearing-impaired speakers do produce, although inconsistently, 
plosives and fricatives with normal airflow patterns, suggesting that at least 
some hearing-impaired speakers are relatively successful in coordinating 
respiration and laryngeal valving. Not surprising, these speakers were among 
the more intelligible in the Whitehead study. Less intelligible' hearing- 
impaired speakers were often quite variable in management of airflow, and they 
did not differentiate- voiced and voiceless cognates aerodynamically . Data 
from these subjects suggest inappropriate laryngeal gestures that could reduce 
airflow or, in other words, an inability of some hearing-impaired speakers to^ 
coordinate respiration and laryngeal valving. 

Another example of laryngeal valving problems can be gleaned from a study 
of laryngeal-supralaryngeal coordination in the speech of the hearing impaired 
(McGarr & LBfqvist, in press). In this experiment, laryngeal activity was 
monitored by means of , transillumination, wherein a flexible fiberscope is used 
to illuminate the larynx, and a phototransistor , placed on the surface of the 
subject's neck below the cricoid cartilage, senses the light passing -between 
the vocal folds. Figure 8 shows selected tokens of an utterance produced by 
one profoundly hearing-impaired speaker. Information about laryrgeal abduc- 
tion/adduction is shown in the transillumination records. Evidence of inap- 
propriate glottal abduction/adduction gestures is noted preceding each test 
word, and also between words in the carrier phrase. Figure 9 shows represen- 
tative samples from a second profoundly hearing-impaired speaker's production 
of the same test words. Similar inappropriate glottal gestures between words 
are again observed. Leaving interarticulator timing for a later discussion, 
this pattern supports the notion of valving problems at the laryngeal level 
consistent, with previous discussions. During pauses between words, these 
hearing -impaired speakers inappropriately opened the glottis, a pattern never 
observed in the production of hearing speakers. Whether these hearing- 
impaired speakers actually took a breath or simply wasted air cannot be 
directly ascertained from these data since simultaneous monitoring of Respira- 
tory activity was not done. However, the authors argue that the latter is 
more likely since the glottal abduction gesture was smaller and shorter in 
duration between'- words than - between utterances. This pattern of aberrant 
laryngeal valving differs from one hypothesized by Stevens, Nickerson, and 
Kollins (in press) . Based on a spectrographic study of deaf children 1 s 
productions, these authors hypothesized that the glottis is closed during 
pauses between "words. 

c? 
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Figure 8. Records of a profoundly hearing-impaired speaker's production of 
the utterance "Say peal again" <left) and "Say beak again" (right). 
Curves represent oral articulation (top) t transillumination 
(middle), and audio envelope (bottom). Onset of labial closure for 
the word initial stops in "peal 11 and "beak" is marked a t release of 
oral closure by ▲ ; 4> marks peak glottal opening. Examples of 
inappropriate abduction/adduction gestures are noted In the 
transillumination record by t. 
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Figure 9. 



Record of a second profoundly hearing-impaired speaker's produc- 
tions of the ' utterances "Say peal again" (left), and "Say beak 
again" (right). Symbols as in Figure 8. 
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B. Phonation 

The larynx serves as the primary source of acoustife energy for speech and 
plays an integral role in changes of stress and intonation, and also voicing 
information- While we have jnoted earlier in this chapter that hearing- 
impaired speakers exhibit grpat difficulty in controlling these phonatory 
parameters, there are few physiological studies of laryngeal, function in the 
. hearing impaired- For convenience of discussion, we will divide laryngeal 
function into two areas: phonatory and articulator y. 

To date, there are few studies that have examined the basic phonatory 
mechanism in hearing-impaired speakers. One study (Monsen, Engebretson, & 
Vemula, 1978) examined the glottal volume-velocity waveforms of hearing- 
impaired speakers using a reflectionless (Sondhi) tube. In this procedure, 
the subject phonates a neutral vowel into the tube, and a microphone 
positioned in the tube records a* pressure waveform that is considered to be an 
approximate of the glottal waveform. It should be noted that the use of the 
Sondhi tube presents some problems in the study of both normal and pathologi- 
_ caL voice production., In order to provide an accurate estimate of the source 
waveform, several conditions must be met. For example, the vocal tract itself 
must have a uniform area function and not contain any side resonators such as 
the nasal passages. Since inappropriate nasal resonance is a common problem 
in the speech of the hearing impaired, data obtained using this measurement 
technique should be interpreted cautiously. Monsen et al. (1978) reported 
that an individual glottal pulse , for a "hearing-impaired speaker was not 
abnormal £er se, but that differences between hearing-impaired and hearing 
subjects were seen for successive changes of the glottal waveform from one 
period to another. Glottal waveforms of hearing- impaired speakers also showed 
evidence of diplophonia and creaky voice. Thus, the authors hypothesized that 
hearing-impaired speakers have difficulty controlling overall tension of the 
vocal folds and also sub-glottal pressure. 

Secondly, high-speed laryngeal films have also provided evidence or 
abnormal laryngeal function in hearing-impaired speakers (Metz, Whitehead, & 
Mahshie, 1982). Films of several profoundly hearing-impaired speakers show 
evidence of inappropriate positioning of the vocal folds prior to the onset of 
phonation and subsequent -patterns of abnormal vocal fold vibration. For 
example, ar>. abnormally high amount of medial compression on the arytenoid 
cartilages was observed in the films of one hearing- impaired speaker, and only 
the anterior one-third of the fotds vibrated freely. The analysis of these 
filnu also revealed that some hearingr impaired speakers do not use appropriate 
abduction/ adduction gestures in producing VCV utterances where C was a 
voicele. consonant. These data speak to the point of difficulty in laryngeal 
articul on, that is, the production of segments requiring control and 
coordin hi of the larynx. 

Laryngeal articulation in the speech of the hearing impaired has been 
examinee in two physiological studies; the first a fiberoptic study of voiced 
and voiceless segments (Mahshie, 1980) and the second a transillumination 
study of obstruent production described previously (McGarr & Lofqvist, in 
press). We have noted in other sections of this chapter that there is 
considerable evidence from the descriptive and acoustic literature to suggest 
/ that hearing- impaired speakers have great difficulty coordinating laryngeal 
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and oral- articulatory gestures. One common problem that illustrates this 
difficulty's confusion of the voiced-voicelefs distinction. 

Let us consider what is required in the production of a voiceless 
obstruent (a plosive, fricative or affricative) in the speech of normals. In 
addition to the supralaryngeal' adjustments used to make the closure or 
constriction, a laryngeal abduction 'adduction gesture normally occurs to stop 
glottal vibration and assist in the build-up of oral pressure. Production of 
these segments ~ thus involves simultaneous activity at both laryngeal and 
supralaryngeal levels, and the laryngeal and oral articulations must be 
cQordinated in time. Variations in this timing are used in a wide variety of 
languages to produce contrasts of voicing and aspiration (cf. Lttfqvist & 
- Yoshioka, 1981). 

An example of how chis interarticulator timing might be manifested in the 
speech of a hearing subject' is shown in Figure 10. Data are taken from the 
transillumination study of McGarr and Lttfqvist described above. At this 
time, we focus on the temporal relationship between the oral and laryngeal 
events. Peak glottal opening for the voiceless /p/ in "peal," shown in the 
transillumination signal (middle record) occurs at .about the same time as the 
end of lip closure (top record) and the burst-release in the acoustic envelope 
(bottom record). The pattern for this plosive is essentially the same as that 
obtained for other hearing speakers' production of obstruents in different and 
unrelated languages (Lttfqvist & Yoshioka, 1981), For production of the 
voiced /b/ in "beak," there is no evidence of glottal opening in the 
i transillumination signal as would be expected for a correct production of this 
segment. \ 

Figure 11 is\an example of a common voiced for voiceless substitution in 
• the speech of a hearing-impaired talker. In this example, the error is due to 
inappropriate positioning of the vocal folds. For production of the /p/, the 
transillumination signal shows no evidence of a glottal opening following the 
onset of lip closure, or any evidence of a burst -release in the acoustic 
signal. Indeed,* listeners judged this production to be a /b/ for /p/ 
substitution. McGaVr and Lttfqvist reported that hearing-impaired speakers 
differed from normal by either emitting the glottal gesture entirely as 
illustrated above, or by producing a glottal gesture when none was required 
(see also above, figures 8 and 9) . In fact, one speaker consistently 
differentiated plosives from fricatives by producing the former' without a 
glottal gesture, but the latter with aji opening gesture. However, even when 
an appropriate laryngeal gesture was made by the hearing-impaired subjects, 
the timing relative to the oral articulatory events was not normal. Similar 
observations on the nature of laryngeal articulation have been made by Mahshie 
U98CO. The data from these two studies suggest that hearing-impaired 
speattfirs have difficulty in coordinating the temporal and spatial demands of 
different articulators. We now turn to some evidence that shows that this 
difficulty in coordination also occurs at the articulatory level. 

C. / Articulation 

/ Articulatory errors in the speech of the hearing-impaired have been 
reviewed above. The error patterns described in the literatyre_ suggest 
several Hypotheses concerning the physiological "underpinnings" of articula- 

26^ 

EEL '• .272 



peal 



beak 




Figure 10. Records of a hearing speaker's productions of the utterances "Say 
peal again" (left) and "Say 'beak again" (right). Symbols as in 
Figure 8. (After McGarr & Lfl'fqvist, in press) • 
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Figure 11. Records of a third profoundly hearing-impaired speaker's production 
of the utterances "Say peal again" (left), and "Say beak again 11 
(right). Symbols as in Figure 8. . This example illustrates the 
common b/p substitution. (After McGarr & Lflfqvist, in press). 
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. tion in the hearing impaired. One hypothesis, derived primarily from studies 
of consonant production, suggests that hearing-impaired speakers place their 
articulators fairly accurately but fail to coordinate interarticulator move- 
ments. These errors may be broadly characterized as errors in timing. 
Another hypothesis primarily concerned with vowel articulation is that hearing- 
impaired speakers move their articulators through a relatively restricted 
range, thereby "neutralizing" vowels. Again, there have been relatively few 
physiological studies of articulation in hearing-impaired speakers, three 
electromyographic investigations (Huntington et al., 1968; McGarr & Harris, 
1980; Rothman, 1977), and two cinef luorographic studies (Stein, 1980; Zimmer- 
man & Rettaliata, 1981). These investigations provide some insight into the 
complelc nature__of__articulatory errors in the hearing impaired. 

For example, electromyographic studies of the speech of hearing-impaired 
persons give ample evidence of instability of production and failure to 
achieve the tight temporal coupling in articulatory muscles. McGarr and 

Harris (1980) have shown that for normal speakers, thd relationship between 

tw^^tictrl^tor^_th^_lips (orbicularis oris) and the tongue (genioglossus) , 
is closely coordinated in time, and that even changes in stress from one 
syllable to another do not disrupt th^s temporal relationship. Indeed, this 
closely timed interarticulator relationship seems to be characteristic of 
normal speech production and is evidenced in many articulatory. muscles across 
changes in stress as well as speaking rate (Tuller, Harris, & Kelso, 1981). 

Figure 12, taken from electromyographic records of a hearing speaker in 
the McGarr and Harris experiment, illustrates this temporal relationship. 
These productions are contrasted in Figure 13 with several examples taken from 
" the records of a hearing-impaired speaker. Clearly, these tokens demonstrate 
considerable variability on the part of the hearing-impaired speaker in 
coordinating the activity of the tongue with the lips. Occasionally, tongue 
activity was timed relatively correctly with respect to lip activity. Most 
often, the hearing- impaired speaker initiated this tongue activity either too 
early or too late relative to the lips. These samples suggest that the 
hearing-impaired speaker does not produce a "wrong" pattern in a stereotypic 
way; rather, productions are variable from token to token not only for 
utterances perceived as correct, but also for utterances perceived as incor- 
rect. It is interesting that this variability in production is observed 
primarily in the lingual rather than the labial component, that is, it is the 
less visible aspect of articulation that varies. Similar observations have 
been made regarding phoneme visibility in earlier EMG studies (Huntington et 
al 1968; Rothman, 1977). However, observations on the variability in both 
perceptually correct and incorrect productions clearly provide new insights 
into the organization of the speech of hearing-impaired talkers. 

Cinefluorographic studies (Stein, 1980; Zimmerman & Rettaliata, 1981) 
provide additional informati.n on upper articulatory movements in heanng- 
x impaired speakers. These X-ray films have been analyzed for an adventitiously 
hearing-impaired speaker in the former study, and also for five pre-lingually 
hea^inl-impaired adults in the latter work. Despite differences in onset of 
hearing loss, these subjects showed patterns of articulatory dynamics similar 
to each other, and not unlike normals in many respects. This is not 
surprising since all of the hearing-impaired speakers were at least partially 
in.tellig:ble. Some of ' the differences between normal and heann 2 ~impaired 
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Figure 12. Ensemble average of the EMG potentials for genioglossus (GG) and 
orbicularis oris (00) for the utterance (apapip) produced by a 
hearing speaker. Stress occurs on V-j in 5a, or \fe in 5b, 
) respectively. The vertical line indicates the acoustic release of 
the /p/ closure. Peak genioglossus activity for the vowel occurs 
at about the same time as the acoustic burst release (after McGarr 
& Harris, 1980). 
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Figure 13. Three selected examples of the EMG potential for the genioglossus 
and orbicularis oris for the utterance /apapip/ produced by a 
profoundly hearing-impaired speaker. The vertical line indicates 
the acoustic release of the /p/ closure* In Figure . 13a, peak 
genioglossus activity occurs between the second and third orbicu- 
laris oris peaks, but it is late relative to the acoustic event. 
This pattern was most like normal. In' Figures 13b and . 13c, the 
, single tokens show that genioglossus- activity was either too late 
or too early, respectively (after McGarr & Harris, 1980). 
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speakers were as follows ♦ Hearing-impaired speakers frequently exhibited 
faster articulatory speeds for lip t tongue, and jaw movements, and articylato- 
ry displacements were often of greater magnitude than for normal?. Vowel 
height differentiation was achieved primarily by jaw movements, and deviant 
positioning of the tongue, with primarily "fronting" of back vowels noted • A 
consistent finding of these . studies was that onset and offset' voicing in 
consonant production was frequently too long. These physiological data agree 
with descriptive studies on voicing errors, particularly that of Millin 
(1971). These results reaffirm the notion that interarti^culator coordination 
is poorly controlled .by hearing-impaired speakers. 

D. , Summary 

Taken tQgether, these studies suggest that the physiological correlates 
of segmental and suprasegmental errors in the speech "of the hearing impaired 
are exceedingly complex. Our knowledge of the physiology of speech production 
by the hearing impaired is still in the germinal stages. While the research 
described above has included only relatively few hearing-impaired speakers, 
and caution mustvbe taken not to overgeneralize results, several interesting 
mechanismB'Of production are beginning to emerge. ■ 

One is that certain physiological characteristics of the production of 
hearing-impaired speech may span an entire utterance, and thus cannot be 
accurately ascribed to either segmental or suprasegmental attributes' of 
speech. These have* been termed postural characteristics by Stevens and his 
colleagues (Steven's, Nickerson, & Rollins, '1978, in press). Examples of 
postural errors - would include inappropriate respiratory control, glottal 
abduction/adduction gestures, vocal fold tension and ma'ss, tongue position and 
\zr\ge of movements, velopharyngeal posture and movements. These postural 
characteristics include riot only the preparatory state for speaking, but also 
the configuration of the^speech production mechanism ove time. We have noted 
several examples in the preceding discussion that* suggest that hearing- 
impaired ^speakers evidence such inappropriate postures. 

• The importance of postural characteristics has also been highlighted 

recently in studies of speech production in normals. Parallels between 

coordinated non-speech and speech activities have been drawn. For .example, a 

non-speech activity such as locomotion is said to be like speech in that both 

may be thought of as having a series of rapid", rhythmic, and highly coordinate 

movements superimposed on a broad posture base. We might think then of speech 

as a complex and rapidly changing articulatory-phonatory process overlayed on 

a slowly changing respiratory base. Thus, the hearing- impaired speaker who 

adopts an inappropriate respiratory posture for whatever reason may preclude 

the coordination and control of movement elsewhere in th * speech production 

mechanism. An inappropriate respiratory posture may be further exacerbated by 

inappropriate glottal gestures, or inappropriate tongue position, and so on. 

«# * 

A second problem evidenced by many hearing-impaired speakers is .great 

difficulty in ^coordinating/ respiration, phonation, and articulation. In 

normal 'speech production, tYte .tight temporal coordination of these events 

constitutes an important component in any theory of speech production. In the 

speech' production of ' the hearing impaired, we have ample evidence for a 

breakdown in interarticulator coordination, for example, in the studies of 

# 
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aerodynamics, laryngeal-supralaryngeal coordination, and articulation cited 
above. These data suggest not only difficulty accommodating the demands of 
speech in space and time,. but also substantial variation in production from 
utterance to utterance. Without such coordination, intelligible speech is f 
impossible and taken together, these factors suggest some reasons why lis- 
teners find the speech of the hearing impaired so difficult to understand. 

Neither problems of postural characteristics nor those of interarticula- 
tory coordination are mutually exclusive. Physiological research focusing on' 
several levels of speech production may prove fruitful in clarifying many of 
the errors docunented in the descriptive literature. A better understanding 
of these problems at the physiological level will' hopefully lead to the 
development of, more effective assessment techniques and training programs for 
hearing- impaired speakers. . , 

VII. SPEECH INTELLIGIBILITY 

We shall use the term "speech intelligibility" to refer to how much of 
what- a child says can be understood by a listener. On the average, the 
intelligibility of profoundly hearing- impaired children's speech is very' poor. 
Only about one'in every five words they say can be understood by a listener 
who is unfamiliar with the speech of this group (Brannon, 1966; John & 
Howarth, 1965; Mar kid es, * 1970; McGarr, 1978; Smith, 1975). 

Before we proceed to a discussion of factors that have been found to 
affect intelligibility, some comments on analysis techniques are necessary. 
First, intelligibility measures in most studies have been based only on a 
listener's auditory judgments of a child* s productions. While this approach, 
may be the most appropriate for quantifying the intelligibility of speech, it 
does not necessarily provide an accurate assessment of a child's ability to 
communicate in a face-to- face situation. 

A second point that should be made is that' the majority of investigators 
who have attempted to determine the effect of specific variables on intelligi- 
bility have done so using a correlational analysis, a statistical analysis of 
the association between the factor of interest and the reduction in intelligi- 
bility. Correlations should be interpreted carefully because a cause and 
effect relationship cannot be inferred from the results. Several studies tnat 
have been, per formed will 'be presented in some detail in this section. 

A. Hearing Level 

A review of the literature indicates that an important factor ..in 
determining the intelligibility of a hearing-impaired child's speech is the 
degree of the child's hearing loss (Boothroyd, 1969; Elliot, 1967; Mar ^es, 
19?S! Montgomery, 1967; Smith. 1975). Boothroyd (1969) found a correlation 
between percent intelligibility scores. and hearing level at all rrequencies. 
particularly at 1000 Hz and 2000 Hz, for a population of h h " r «8-^ re J 
children from the Clarke School .for the Deaf. In fact, the data formed a 
bimodal distribution: the" children with good speech intelligibility (^elli- 
Ribilitv score of 70* or more) had. considerable hearing, while those children 
with poor ^ intelligibility (70*. or less) had little residual hearing. The 
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median. hearing level of the group with good speech intelligibility was 90 dB 
and f as the hearing loss exceeded 90 d3 at 1000 Hz, the median speech scores 
fell rapidly. In another study that analyzed the speech intelligibility of 
profoundly hearing-impaired children, Staith (1975) observed a systematic 
decrease in intelligibility with poorer hearing level to a level of about 85 
dB HTL f after which the relationship was not clear. Monsen (1978) found that 
all the children he studied -with hearing losses of 95 dB HTL or less had 
intelligible speech, but children with losses greater than 95 dB HTL did not 
always have poor or unintelligible speech. These data indicate that even 
though a child has a profound hearing loss, he or; she still has the potential 
to develop functional speech skills. 

Two studies of interest are those by Staith (1975) aud Gold (1978), which 
were described in the preceding section. Recall that the same test materials 
and procedures were used in the two studies to assess the speech of different 
populations of hearing-impaired children. The average intelligibility of the 
profoundly hearing-impaired children's speech in an oral day school for the 
deaf was reported by Staith to be about 19* • Gold (1978) reported an average 
intelligibility score erf 39X for the mainstreamed profoundly hearing-impaired 
children assessed in her study. Thus, children with similar hearing levels in 
different educational settings showed an average difference of 20X in their 
intelligibility scores. Not unexpectedly, research has shown that the intel- 
ligibility of hard-of-hearing children! s speech is substantially higher than 
that of profoundly hearing- impaired children. Average intelligibility scores 
of 70-76'X have been reported for the hard of hearing (Gold, 1978; Markides f . 
1970). 

Higher intelligibility scores than those mentioned above have been 
reported by Monsen (1978). His results revealed an average intelligibility 
score of 91% for severely hearing- impaired children, and a score of 76% for 
the profoundly hearing- impaired children in his study. Monsen (1978) hs' 
attributed the difference in intelligibility scores between his and other 
studies to differences in the speech material that the children were required 
to produce. According to Monsen (1978), the sentences in his study were 
shorter, contained a more familiar vocabulary, and were syntactically less 
complex than those used by other investigators. In fact, McGarr (1980) has 
shown that intelligibility scores for hearing-impaired speakers may vary 
considerably depending on speech material (sentences or words) , amount of 
context, phonetic composition, and, of course, experience of the listener. 

The above studies indicate that although the degree of hearing loss is an 
important variable, this measure alone cannot reliably predict the intelligi- 
bility of a child's spfeech. In faot, in a study by Staith (1975), hearing 
level was found to be only a fair predictor of the speech intelligibility of 
profoundly hearing-impaired children. The hearing measure found to be most 
closely correlated with speech intelligibility was performance on an auditory 
phoneme recognition test. This finding suggests -that it is not hearing level 
per se that -is most important for the development of intelligible speech, but 
rather the ability of the hearing-impaired child t9 make use of the acoustic 
cues that are available to him or her. 
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B, Segmental Errors 

It has generally been found that as the overall frequency of segmental or 
phonemic errors increases in the speech of the hearing impaired, intelligibil- 
ity decreases (Brannon, 1966; Gold, 1978; Hudgins & Numbers, 1942; Markides, 
1970; Smith, 1975). However, the- number of segmental errors alone cannot 
account for reduced intelligibility. Smith (1975), for example, observed that 
some of the subjects in her study who had approximately the same frequency of 
segmental errors had speech intelligibility scores differing by as much as 30 
percent. Smith hypothesized that these differences appeared to be related, in 
part, to certain suprasegmental errors that interacted in a complex manner 
with the segmental errors to reduce intelligibility. 

The relationship between specific types of segmental errors and intelli- 
gibility has been examined to some extent by Hudgins and Numbers (1942) and 
later by .Smith (1975). In their classic study, Hudgins and Numbers found a 
nigh negative correlation between intelligibility and total number of vowel 
errors (-.61) and total number of consonant errors (-.70). Similar -esults 
were reported by Smith, except that she found a slightly higher correlation 
between vowel errors and intelligibility than did Hudgins and Numbers. 

Of the seven consonant error categories considered in the Hudgins and 
Numbers (1942) study,, three categories (omission of initial consonants, 
voiced- voiceless confusions, and errors involving compound consonants) had the 
most significant effect on intelligibility. The other four categories 
considered (substitution errors, nasality errors, omission of final 
consonants, and errors involving abutting consonants) had a lower correlation 
with intelligibility and contriouted to a much lesser extent to the reduced 
intelligibility of hearing- impaired children's speech: . 

In a recent study. Monsen (1978) examined the relationship between 
intelligibility and four acoustically measured variables of consonant produc- 
tion three acoustic variables of vowel production, and two measures ot 
prosody. A multiple regression analysis showed that three variables had a 
high multiple correlation (.85) with intelligibility and thus accounted for 
73% of the variance: (1) the difference in voice-onset tire between /t/ and 
/d/* (2) the difference in second formaht location between /i/ and /I/£and 
(3)' acoustic characteristics of the nasal and liquid consonants. The first 
two variables accounted for almost 69* of the variance. 

Other segmental errors that have been observed to have a significant 
negative correlation with intelligibilUy are: omission of phonemes in the 
word-initial and medial position; consonant substitutions involving a change 
in the manner of articulation; substitutions of non-English phonemes such as 
the glottal stop, and unidentifiable or gross distortions of the intended 
phoneme (Levitt "et al., 1980). 

C. Suprasegmental Errors 

The suprasegmental errors examined most extensively in relation to 
intelligibility have been those involving timing. One of tne earliest 
attempts to determine * the relationship between deviant timing |»tterns and 
intelligibility is found in the study by Hudgins and Numbers (1942). Although 
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they correlated rhythm errors with intelligibility, many ;f these errors 
appear to be due to poor timing control. They found that sentences spoken 
with correct rhythm were substantially more intelligible than those that were 
not. The correlation between speech rhythm and intelligibility was .73, 
which was similar to the correlation between total consonant errors and 
intelligibility, and higher than the correlation found between vowel errors 
and intelligibility. 

¥ 

The results of other correlational studies have typically shown a 
moderate negative correlation between excessive prolongation of speech seg- 
ments and intelligibility (Monsen & Leiter, 1975; Parkhurst h Levitt, 1978) • 
In a recent study, Reilly (1979) found N that relative duration 
(stressed .'unstressed syllable nuclei duration ratio) demonstrated a systematic 
relationship with intelligibility. Reilly (1979) suggested that the better 
able the profoundly hearing-impaired speaker was to produce the segmental, 
lexical, and syntactic structure of the utterance, the more intelligible the 
utterance was likely to be. Data reported by Parkhurst and J-evitt (1978) 
indicate that another type of timing error, the insertion of short pauses at 
syntactically appropriate boundaries, had a positive effect on 
intelligibility; the presence of these pauses actually helped to improve 
intelligibility. 

Studies that have attempted to determine the cause and effect relation- 
ship between speech errors and intelligibility have dealt primarily with 
timing. These causal studies can be sub-divided into two major categories: 
studies in which hearing-impaired children receive intensive training for the 
correction of timing errors, and studies in which timing errors are corrected 
in hearing-impaired children's recorded speech samples using modern signal- 
processing techniques. 

The classic training study that attempted to determine the causal 
relationship between timing errors and intelligibility was conducted by John 
and Howarth (1965). These investigators reported a significant improvement in 
the intelligibility of profoundly hearing-impaired children's speech after the 
children had received intensive training focusing only on the correction of 
timing errors. In contrast to this, Houde (Note 2) observed a decrement in 
intelligibility when timing errors of hearing-impaired speakers were correct- 
ed, and the results of a similar study by Boothroyd et al. (19 7l O were 
equivocal* 

A major problem with the training studies is that the training may result 
in changes in the child's speech other than those of interest. Recent 
investigations have attempted to eliminate this confounding variable by using 
computer processing techniques. In these studies, speech is either synthe- 
sized with timing distortions, or synthesized versions of the speech of the 
hearing impaired are modified so that timing errors are corrected. Lang 
(1975) used an analysis-synthesis approach to correct timing errors in tjhe 
speech samples produced by hearing-impaired speakers, and also to introduce 
timing distortions in the samples of normal speakers. Minimal improvements in 
intelligibility were observed for the speech of the hearing impaired, and 
minimal decrements in intelligibility were observed for the normal speakers. 
Bernstei.i (1977), however, found no reduction in the intelligibility of speech 
samples .produced by a normal speaker when synthesized with timing errors.. On 



the other hand, Huggins (1978) found that when normal speech was synthesized 
with the durational relationship between stressed and unstressed syllables 
reversed, there was a substantial reduction in intelligibility. Even greater 
' reductions in intelligibility occurred when the stress assignments for both 
pitch andduration were incorrect. 

In an attempt to resolve some of the conflicting information in this 
area, Osberger and Levitt (1979) quantified the relative effect of timing 
errors on intelligibility by means of computer simulation. Speech samples 
produced by gearing-impaired children were modified to correct timing errors 
only, leaving all other aspects of the speech unchanged. Three types of 
correction were performed: relative timing, absolute syllable duration, and 
pauses. Each error was corrected alone and together with one of the other 
timing errors. An average improvement in Intelligibility was observed only 
when relative timing errors alone -were corrected. The improvement, however, 
was very small (4%). Since the timing modifications for th<a condition 
involved only the correction of the duration, rjtio for stressed-to-unstressed 
vowels, the overall durations of the vowels (and syllables) were still longer 
than the corresponding durations in normal speech. These data indicate .that 
the prolongation of syllables and vowels, which is one of the most obvious 
deviancies of the speech of the hearing impaired, does not in itself have a 
detrimental effect on intelligibility . 

Attempts have also been made to determine the relationship between errors 
involving fundamental frequency (Fo) control and intelligibility. Monsen 
(1978) found that there was no clear-cut relationship between mean Fo and mean 
amount of Fo change and intelligibility. In their study, McGarr and Osberger 
(1978) found that, for the majority of the children st ud ltd, there feem ed to 
be no simple relationship between pitch deviancy and intelligibility. Some 
children whose pitch was judged appropriate for their age and . sex had 
intelligible speech, while others did not. The exception to this pattern were 
the children who were unable to sustain phonation and whose speech contained 
numerous pitch breaks. Their speech was consistently judged to be unintelli- 
gible. Smith (1975) also found that errors involving poor phonatory control 
(intermittent phonation." spasmodic variations of pitch and loudness, and 
excessive variability of intonation) had a high correlation with intelligibil- 
ity . 

Data obtained by Parkhurst and Levitt (1978) also suggest that excessive 
variations in pitch may reduce intelligibility. In this study, a multiple 
linear regression analysis was performed ." relating intelligibility to various 
prosodic distortions judged to occur in the speech of hearing- impaired 
children.. Breaks in pitch were one ^of the prosodic errors showing a 
significant negative regression with intelligibility. The effect of the less 
deviant patterns, such as elevated Fo. has not been clearly established, 
although preliminary data suggest that these problems will not have a serious 
effect on intelligibility. 

In summary, we have relatively little information regarding the effect of 
- errors, or combination of errors, on the intelligibility of hearing- impaired 
• children's speech, nor are we able to predict reliably if a child has the 
potential to develop intelligible speech. Some background variables appear, to 
t£ important, such as the hearing status of the parent, while others, such as 
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age of identification of hearing impairment, hearing aid use, start of special 
education; IQ f and the hearing status of siblings show little or no correla- 
tion with speech intelligibility (Smith, 1975) • 

< 

VII. CONCLUDING COMMENTS 

We shall now summarize some of the major points discussed in the chapter, 
and discuss the implications of the available data for the development of 
assessment and training techniques. On the basis of the data presented, the 
following statements can be made regarding the speech production skills of 
hearing-impaired children: 

1. Rate of vocal output cannot be used to describe accurately the 
differences in the vocalization behavior between hearing and hearing-impaired 
infants. Striking differences between the vocalizations of normal-hearing and 
hearing-impaired infants do emerge at an early age, but the differences are 
seen Jn phonemic production rather than rate of vocal output. Specifically, 
hearing-impaired infants tend to produce stereotypic vocalization patterns 
with a .educed phonemic repertoire relative to hearing infants. 

2. The developmental stages of speech acquisition in the hearing im- 
paired appear to be similar to those of normal-hearing children in some 
respects but not in others. Also, the speech production patterns of older 
hearing-impaired children show many similarities to the patterns of the 
younger hearing-impaired children. 

3. Segmental errors, as determined b^ phonetic transcriptions of hearing- 
impaired children's speech can be classified by the following two categories: 

a) Omission Errors: This type of error most often involves consonants, 
particularly those in the word-final position. Omission of vowels is 
infrequent and usually does not occur unless the entire syllable has 
been omitted. 

b) Substitution Errors: Frequent errors in this category involve confu- 
sion between voiced-voiceless cognates, substitution of a consonant 
with the same place of production but a different manner of produc- 
tion as the intended consonant, (and vice versa), and substitution of 
non-English sounds, particularly the glottal stop for the intended * 
phoneme. Vowel errors in this category typically involve tense-lax , 
substitutions, substitutions toward a vowel that is more central than 
the t'arget vowel, and substitution of the schwa vowel for the 

" intended vowel. ' Diphthong errors frequently involve substitutions of 
one of the elements of a closely related vowel. 

4. Errors' are less frequent for consonant phonemes produced at the front 
of the mouth (the labial and labio-dental 'consonants) as compared to phonemes 
with a place of articulation at the middle or back of the mouth. 
Traditionally, this pattern of production has been attributed to the greater^ 
visibility of phonemes produced in th* front of the mouth. Other artiqulatory 
considerations, such as the relatively constrained movements of the most 
visible articulators, the lips, may also account for this production pattern. 

<> 
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5, Similar error patterns have been found to Occur in the speech of 
different groups of hearing-impaired children. The largest difference between 
children is in the frequency of errors; type of error may also vary, but to a 
lesser extent than frequency of errors, 

6. At the suprasegmental level of production, poor timing control 
produces the following deviations: 

a) Prolongation of speech segments 

b) Distortion of temporal relationship between speech segments 

c) Insertion of frequent and lengthy pauses often at syntactically 
inappropriate boundaries 

d) Distortion of phonetic context effects 

e) Insertion of adventitious phonemes. 

Poor control of fundamental frequency can result in problems such as: 

a) Average pitch level too high 

^ b) Intonation with insufficient variability 

c) Intonation with excessive variability. 

Abnormal voice characteristics such as harshness, breathiness, Jiyper- and 
hyponasality may also be present. 

7 Acoustic analyses have shown manifestations of the above perceptual 
errors in the distortion of voice-onset-time, forraant frequency transitions, 
frequency location of the formants, and segmental durations. 

8. Recent studies have begun to detail the physiological correlates of 
segmental and suprasagmental errors. These studies show that the underlying 
causes of error patterns are more complex- than has been alluded to in the 
descriptive literature. Some of the production mechanisms responsible for the 
perceptual and acoustic distortions are poor respiratory control, evidenced by 
initiation of phonation at too low a level of vital capacity and production of 
a reduced number of syllables per breath f abnormal laryngeal function, 
eviderced by laryngeal valving problems and failure to coordinate, laryngeal 
and respiratory evqnts, and a breakdown in interarticulator programming, 
evidenced by poor control and coordination of articulatory gestures, both at 
the laryngeal" and supralaryngeal levels of. production. Improper postural, 
characteristics of the- speech mechanism may affect many aspects of speech 
production and result in segmental and suprasegmental misperc.eptions. 

9. Although there are many deviations in the speech of the hearing- 
impaired, these deviations do not generally occur in a random way. Ther e is 
evidence that many of the deviations are phonetically and Penological y 
consisted albeit the systems may not be the same as those used by/normal- 
hearing calkers. However, the use of a deviant phonological system will still 
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pose problems for the listener who must decode the intended message. Data are 
also available suggesting that hearing-impaired talkers manipulate some seg- 
mental, lexical, and syntactic aspects of speech in the same manner as 
normals. 

10. The intelligibility of the speech of children with profound hearing 
losses in day schools fori the hearing impaired has been reported to be about 
20%. This figure is based on the percentage of words correctly understood 
through audition alone by persons who are unfamiliar with the speech of the 
hearing impaired. Under the same conditions, the intelligibility of the 
speech of children with profound losses who are mainstreamed has been found to 
be about 40%. The intelligibility of the speech of hard-of-hearing children 
is substantially higher than that of severely and profoundly hearing-impaired 
children. 

1T. The intelligibility of hearing-impaired children's speech has been 
found to be influenced by the degree of linguistic context and the experience 
of > the listener with the speech of the hearing-impaired. 

12. The relationship between specific error types and^intelligibility 
has not been clearly established. Correlational studies show a high degree of 

association between the frequency of segmental — errors and reduction in_ 

intelligibility. Of the various error types that have been studied, the 
highest correlations have • been reported for overall frequency- of phonemic 
errors, errors of omission in the word-initial and .medial position, substitu- 
tions -involving a change in the manner of articulation, substitution of non- 
English phonemes* and unidentifiable or other gross distortions of .the 
intended phonemes. . At the suprasegmental level, timing errors and errors 
involving poor phonatory control have been found to have a negative effect on 
intelligibility. 

Although our knowledge" about the speech of the hearing impaired is far 
from "completer rimpTiira^i~o^ 

gleaned from the aforementioned findings. First, hearing-impaired children 
show distinct error patterns, and unless appropriate assessment instruments 
are used; some errors may go undetected. . Second, in addition to assessing 
spe£dt\ structures, clinician? and teachers must attempt to evaluate the 
adequacy of respiratory, laryngeal, and articula'tory maneuvers essential for 
normal speech production. By this, we do not mean to imply that physiological 
measures should be performed routinely in the clinic. Rather, through 
clinical observatiorf and perceptual measures, inferences* can be made about the 
underlying speech production mechanism. .Third, a phonological analysis of an 
individual child's sound system will enable *the clinician to determine if a 
child's speech deviates from normal in a systematic way, or if the errors are 
random. • 

* . * 

Following the evaluation, the clinician or teacher should raise pertinent 
questions regarding each child's error patterns and production skills. Such 
questions include the following: / 
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1. Does the child have a diverse sound system? 

a) Are the basic contrasts, i.e., oral-nasal, stop-continuant, fricative- 
nonfricative, present in the child 1 s sound system? 

b) Are these contrasts pr^sfeqt for the different places of artipulation, 
i.e., front, mid, back? 

c) Is there vowel differentiation, i.e., fronUback contrast, high- low 
contrast? 

d) Are non-English sounds (glottal stop) or unidentifiable sounds fre- 
quently substituted for the intended phoneme? 

*- 

2. Is there adequate contrql of the speech mechansim? 

•» 

a) Is there adequate breath management^ Is the feature of frication 
absent or distorted; -is there evidence of phrase structure, with or 
without a terminal fall in pitch? 

b) Is there poor velopharyngeal control that results in segmental errors 
(substitution of oral sounds for nasal sounds), and an abnormal voice 
quality ( hyper nasality)-? 

c) Is there adequate laryngeal control? Are there excessive changes in 
pitch, are there inappropriate changes in pitch? Are there localized 
changes in fundamental frequency that are not linked appropriately to 

changes in lexical , stress? 

f 

d) Is there coordination between- laryngeal and supralaryngeal movements, 
.i.e., are there voiced- voiceless errors? 

_____ e)__ Is there ^independent contr.olof vowel production, and, pitch control? 

Is there a- noticeable difference in pitch between- productions of low 
vowels a/ and high vowels /i, u/? 

\ 

f) Is there adequate [timing control?" ;Is overall rate too, slow; are 
there adventitious sounds; are there distortions of temporal rela- 
tionships between segments and distortion of phonetic context effects 
iti the temporal d amain; are pauses frequently inserted; is there 
glottal ization? " ' ' • 

0nce^b€3e areas are jaddressed, an optimal training sequence can l?e 
selected to -meet the individual needs of, each child. The effectiveness of the 
training strategies can be assessed through careful and objective^monitoring 
of the.cllUd $ s performance ijn speech therapy. 

' ' ■ . ! REFERENCE NOTES 

» "^z ~ 
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